A collection of tools for extracting data into tidy DataFrames.
tidyextractors makes extracting data from supported sources as painless as possible, delivering you a populated Pandas DataFrame in three lines of code. tidyextractors was inspired by Hadley Whickham’s (2014) paper which introduces “tidy data” as a conceptual framework for data preparation.
For more information, including code examples, API reference, and general documentation, click HERE.
- Extracts data with minimal effort.
- Creates readable code that requires minimal explanation.
- Exports Pandas Dataframes to maximize compatibility with the Python data science ecosystem.
Currently Implemented Data Sources
In the near future, tidyextractors will be distributed on PyPI and accessible via pip. For now, clone the repository and run pip install -e . in the cloned directory.