Data Science Ontology

Welcome to the Data Science Ontology, with 153 data science concepts and 100 code annotations
The Data Science Ontology is a knowledge base about data science that aims to
  • catalog the concepts of data science
  • semantically annotate popular software packages for data science
  • power new AI assistants for data scientists

Concepts

Concepts formalize the abstract ideas of data science.
Sample concept
Name
generalized linear model (glm)
Kind
type
Description
A generalized linear model (GLM), an exponential family model extending the linear model beyond the normal distribution, fit by maximum likelihood estimation.

Annotations

Annotations translate data science code into concepts
Sample annotation
Name
SQLAlchemy engine (python/sqlalchemy/engine)
Kind
type
Language
python Python
Package
sqlalchemy
Description
Database engine in SQLAlchemy