Skip to content

packing-box/python-dsff

Repository files navigation

DataSet File Format Tweet

Store a dataset in XSLX-like format.

PyPi Read The Docs Build Status Coverage Status Python Versions Known Vulnerabilities License

This library contains code for handling the DataSet File Format (DSFF) based on the XSLX format and for converting it to ARFF (for use with the Weka framework), CSV or a FilelessDataset structure (from the Packing Box).

pip install --user dsff

😎 Usage

Creating a DSFF from a FilelessDataset

>>> import dsff
>>> with dsff.DSFF() as f:
    f.write("/path/to/my-dataset")  # folder of a FilelessDataset (containing data.csv, features.json and metadata.json)
    f.to_arff()                     # creates ./my-dataset.arff
    f.to_csv()                      # creates ./my-dataset.csv
    f.to_db()                       # creates ./my-dataset.db (SQLite DB)
# while leaving the context, ./my-dataset.dsff is created

Creating a FilelessDataset from a DSFF

>>> import dsff
>>> with dsff.DSFF("/path/to/my-dataset.dsff") as f:
    f.to_dataset()  # creates ./my-dataset with data.csv, features.json and metadata.json

**

➕ Extensions

Install all available extensions

pip install --user dsff[all]

Dealing with Apache Arrow formats

pip install --user dsff[arrow]
>>> import dsff
>>> with dsff.DSFF("/path/to/my-dataset.dsff") as f:
    f.to_feather()  # creates ./my-dataset.feather
    f.to_orc()      # creates ./my-dataset.orc
    f.to_parquet()  # creates ./my-dataset.parquet

⭐ Related Projects

You may also like these:

👏 Supporters

Stargazers repo roster for @packing-box/python-dsff

Forkers repo roster for @packing-box/python-dsff

Back to top

Contributors 2

  •  
  •  

Languages