BerryDB Python SDK¶
Welcome to the documentation of BerryDB Python SDK.
Introduction 👋¶
The BerryDB Python SDK offers a complete toolkit for developers and data scientists to interact with the BerryDB data platform. It’s designed to streamline the entire machine learning (ML) workflow, from managing datasets and annotations to handling model development and evaluation. 🚀
Core Components 🏗️¶
These modules provide foundational access to your data and annotation projects.
BerryDB
andDatabase
: These are the primary entry points for connecting to and managing your BerryDB databases, allowing for fundamental data access and manipulation. 💾Annotation Project
andAnnotations Config
: Central to creating and configuring data labeling tasks, these modules help define the structure and rules for annotating datasets, crucial for building high-quality training data. 🏷️
Model Management 🤖¶
This suite of modules offers a robust framework for versioning, storing, and configuring your machine learning models.
Model Repo
,Model Config
, andModel
: Together, these components facilitate the complete lifecycle of a model within the BerryDB ecosystem, ensuring models are reproducible and simplifying the path to deployment. 🔄
Evaluation and Settings 📊¶
These modules focus on assessing model performance and customizing the SDK’s behavior.
Evaluator
: An essential tool for a critical step in the ML lifecycle, providing functionality to run evaluations, compare model performance, and analyze results. ✅Settings
: Allows for the customization of the SDK’s behavior, enabling you to tailor various configurations to your specific project requirements. ⚙️
Table of Contents¶
Contents
- Installation
- BerryDB
- Database
Database.settings()
Database.enable_fts()
Database.database_name()
Database.get_all_documents()
Database.get_all_documents_with_col_filter()
Database.get_document_by_object_id()
Database.query()
Database.upsert()
Database.ingest_pdf()
Database.embed()
Database.chat()
Database.chat_for_eval()
Database.similarity_search()
Database.evaluator()
Database.ner()
Database.text_classification()
- Annotation Project
- Annotations Config
- Model Repo
- Model Config
- Model
- Settings
- Evaluator