= tempfile.mkdtemp("mnist_experiment")
group
tiledb.group_create(group)= tiledb.Group(group, "w")
tdb_grp tdb_grp.close()
Machine Learning: Groups
In this tutorial, you will learn how to organize your Machine Learning (ML) assets under a single, logical group. These assets can be datasets, models (from different ML frameworks solving the same problem), or any other TileDB asset that you want to bundle under the same semantics. Storing ML datasets alongside ML models in a registry for experimentation offers several advantages:
- Enhanced reproducibility
- Version consistency
- Efficient experimentation
- Iterative model development
- Data versioning and lineage
- Collaboration and knowledge sharing
- Complete experiment documentation
- Reduced dependencies on external resources
The following tutorial assumes that you have successfully completed the tutorials for ML dataset ingestion and ML model ingestion in TileDB.
Create an ML group
You can create a group that will store your ML assets:
You can create also sub-groups according to your use case. Users have the flexibility to bundle multiple models and datasets, from the same or different frameworks, into a group. This group can encompass elements that address or correspond to the same application, solution, algorithm, or problem statement. For this tutorial, you will be creating two subgroups. This will allow you to store your datasets and models under the same group, which can serve as a directory for storing all your experiment’s related assets, following a logical hierarchy.
Add a dataset array to a sub-group
Add a model array to a sub-group
Iterate group members
Delete groups
For more details on TileDB groups, visit Arrays Tutorials: Groups.