Data Science Ontology

Welcome to the Data Science Ontology, with 153 data science concepts and 100 code annotations
The Data Science Ontology is a knowledge base about data science that aims to
  • catalog the concepts of data science
  • semantically annotate popular software packages for data science
  • power new AI assistants for data scientists

Concepts

Concepts formalize the abstract ideas of data science.
Sample concept
Name
lasso (lasso)
Kind
type
Description
A sparse linear regression model, fit by least squares with L1 penalty

Annotations

Annotations translate data science code into concepts
Sample annotation
Name
k-means clustering in R (r/stats/k-means)
Kind
type
Language
r R
Package
stats