- catalog the concepts of data science
- semantically annotate popular software packages for data science
- power new AI assistants for data scientists
Data Science Ontology
Welcome to the Data Science Ontology, with
153 data science concepts and
100 code annotations
The Data Science Ontology is a knowledge base about data science that aims to
Concepts
Concepts formalize the abstract ideas of data science.
Sample concept
- Name
- generalized linear model (glm)
- Kind
- type
- Description
- A generalized linear model (GLM), an exponential family model extending the linear model beyond the normal distribution, fit by maximum likelihood estimation.
Annotations
Annotations translate data science code into concepts
Sample annotation
- Name
- SQLAlchemy engine (python/sqlalchemy/engine)
- Kind
- type
- Language
Python
- Package
- sqlalchemy
- Description
- Database engine in SQLAlchemy