1. Catalog
  2. Code
  3. Notebooks
  • Home
  • What is TileDB?
  • Get Started
  • Explore Content
  • Accounts
    • Individual Accounts
      • Apply for the Free Tier
      • Profile
        • Overview
        • Cloud Credentials
        • Storage Paths
        • REST API Tokens
        • Credits
    • Organization Admins
      • Create an Organization
      • Profile
        • Overview
        • Members
        • Cloud Credentials
        • Storage Paths
        • Billing
      • API Tokens
    • Organization Members
      • Organization Invitations
      • Profile
        • Overview
        • Members
        • Cloud Credentials
        • Storage Paths
        • Billing
      • API Tokens
  • Catalog
    • Introduction
    • Data
      • Arrays
      • Tables
      • Single-Cell (SOMA)
      • Genomics (VCF)
      • Biomedical Imaging
      • Vector Search
      • Files
    • Code
      • Notebooks
      • Dashboards
      • User-Defined Functions
      • Task Graphs
      • ML Models
    • Groups
    • Marketplace
    • Search
  • Collaborate
    • Introduction
    • Organizations
    • Access Control
      • Introduction
      • Share Assets
      • Asset Permissions
      • Public Assets
    • Logging
    • Marketplace
  • Analyze
    • Introduction
    • Slice Data
    • Multi-Region Redirection
    • Notebooks
      • Launch a Notebook
      • Usage
      • Widgets
      • Notebook Image Dependencies
    • Dashboards
      • Dashboards
      • Streamlit
    • Preview
    • User-Defined Functions
    • Task Graphs
    • Serverless SQL
    • Monitor
      • Task Log
      • Task Graph Log
  • Scale
    • Introduction
    • Task Graphs
    • API Usage
  • Structure
    • Why Structure Is Important
    • Arrays
      • Introduction
      • Quickstart
      • Foundation
        • Array Data Model
        • Key Concepts
          • Storage
            • Arrays
            • Dimensions
            • Attributes
            • Cells
            • Domain
            • Tiles
            • Data Layout
            • Compression
            • Encryption
            • Tile Filters
            • Array Schema
            • Schema Evolution
            • Fragments
            • Fragment Metadata
            • Commits
            • Indexing
            • Array Metadata
            • Datetimes
            • Groups
            • Object Stores
          • Compute
            • Writes
            • Deletions
            • Consolidation
            • Vacuuming
            • Time Traveling
            • Reads
            • Query Conditions
            • Aggregates
            • User-Defined Functions
            • Distributed Compute
            • Concurrency
            • Parallelism
        • Storage Format Spec
      • Tutorials
        • Basics
          • Basic Dense Array
          • Basic Sparse Array
          • Array Metadata
          • Compression
          • Encryption
          • Data Layout
          • Tile Filters
          • Datetimes
          • Multiple Attributes
          • Variable-Length Attributes
          • String Dimensions
          • Nullable Attributes
          • Multi-Range Reads
          • Query Conditions
          • Aggregates
          • Deletions
          • Catching Errors
          • Configuration
          • Basic S3 Example
          • Basic TileDB Cloud
          • fromDataFrame
          • Palmer Penguins
        • Advanced
          • Schema Evolution
          • Advanced Writes
            • Write at a Timestamp
            • Get Fragment Info
            • Consolidation
              • Fragments
              • Fragment List
              • Consolidation Plan
              • Commits
              • Fragment Metadata
              • Array Metadata
            • Vacuuming
              • Fragments
              • Commits
              • Fragment Metadata
              • Array Metadata
          • Advanced Reads
            • Get Fragment Info
            • Time Traveling
              • Introduction
              • Fragments
              • Array Metadata
              • Schema Evolution
          • Array Upgrade
          • Backends
            • Amazon S3
            • Azure Blob Storage
            • Google Cloud Storage
            • MinIO
            • Lustre
          • Virtual Filesystem
          • User-Defined Functions
          • Distributed Compute
          • Result Estimation
          • Incomplete Queries
        • Management
          • Array Schema
          • Groups
          • Object Management
        • Performance
          • Summary of Factors
          • Dense vs. Sparse
          • Dimensions vs. Attributes
          • Compression
          • Tiling and Data Layout
          • Tuning Writes
          • Tuning Reads
      • API Reference
    • Tables
      • Introduction
      • Quickstart
      • Foundation
        • Data Model
        • Key Concepts
          • Indexes
          • Columnar Storage
          • Compression
          • Data Manipulation
          • Optimize Tables
          • ACID
          • Serverless SQL
          • SQL Connectors
          • Dataframes
          • CSV Ingestion
      • Tutorials
        • Basics
          • Ingestion with SQL
          • CSV Ingestion
          • Basic S3 Example
          • Running Locally
        • Advanced
          • Scalable Ingestion
          • Scalable Queries
      • API Reference
    • AI & ML
      • Vector Search
        • Introduction
        • Quickstart
        • Foundation
          • Data Model
          • Key Concepts
            • Vector Search
            • Vector Databases
            • Algorithms
            • Distance Metrics
            • Updates
            • Deployment Methods
            • Architecture
            • Distributed Compute
          • Storage Format Spec
        • Tutorials
          • Basics
            • Ingestion & Querying
            • Updates
            • Deletions
            • Basic S3 Example
            • Running Locally
          • Advanced
            • Versioning
            • Time Traveling
            • Consolidation
            • Distributed Compute
            • RAG LLM
            • LLM Memory
            • File Search
            • Image Search
            • Protein Search
          • Performance
        • API Reference
      • ML Models
        • Introduction
        • Quickstart
        • Foundation
          • Basics
          • Storage
          • Cloud Execution
          • Why TileDB for Machine Learning
        • Tutorials
          • Ingestion
            • Data Ingestion
              • Dense Datasets
              • Sparse Datasets
            • ML Model Ingestion
          • Management
            • Array Schema
            • Machine Learning: Groups
            • Time Traveling
    • Life Sciences
      • Single-cell
        • Introduction
        • Quickstart
        • Foundation
          • Data Model
          • Key Concepts
            • Data Structures
            • Use of Apache Arrow
            • Join IDs
            • State Management
            • TileDB Cloud URIs
          • SOMA API Specification
        • Tutorials
          • Data Ingestion
          • Bulk Ingestion Tutorial
          • Data Access
          • Distributed Compute
          • Basic S3 Example
          • Multi-Experiment Queries
          • Appending Data to a SOMA Experiment
          • Add New Measurements
          • SQL Queries
          • Running Locally
          • Shapes in TileDB-SOMA
          • Drug Discovery App
        • Spatial
          • Introduction
          • Foundation
            • Spatial Data Model
            • Data Structures
          • Tutorials
            • Spatial Data Ingestion
            • Access Spatial Data
            • Manage Coordinate Spaces
        • API Reference
      • Population Genomics
        • Introduction
        • Quickstart
        • Foundation
          • Data Model
          • Key Concepts
            • The N+1 Problem
            • Architecture
            • Arrays
            • Ingestion
            • Reads
            • Variant Statistics
            • Annotations
            • User-Defined Functions
            • Tables and SQL
            • Distributed Compute
          • Storage Format Spec
        • Tutorials
          • Basics
            • Basic Ingestion
            • Basic Queries
            • Export to VCF
            • Add New Samples
            • Deleting Samples
            • Basic S3 Example
            • Basic TileDB Cloud
          • Advanced
            • Scalable Ingestion
            • Scalable Queries
            • Query Transforms
            • Handling Large Queries
            • Annotations
              • Finding Annotations
              • Embedded Annotations
              • External Annotations
              • Annotation VCFs
              • Ingesting Annotations
            • Variant Statistics
            • Tables and SQL
            • User-Defined Functions
            • Sample Metadata
            • Split VCF
          • Performance
        • API Reference
          • Command Line Interface
          • Python API
          • Cloud API
      • Biomedical Imaging
        • Introduction
        • Foundation
          • Data Model
          • Key Concepts
            • Arrays
            • Ingestion
            • Reads
            • User Defined Functions
          • Storage Format Spec
        • Quickstart
        • Tutorials
          • Basics
            • Ingestion
            • Read
              • OpenSlide
              • TileDB-Py
          • Advanced
            • Batched Ingestion
            • Chunked Ingestion
            • Machine Learning
              • PyTorch
            • Napari
    • Files
  • API Reference
  • Self-Hosting
    • Installation
    • Upgrades
    • Administrative Tasks
    • Image Customization
      • Customize User-Defined Function Images
      • AWS ECR Container Registry
      • Customize Jupyter Notebook Images
    • Single Sign-On
      • Configure Single Sign-On
      • OpenID Connect
      • Okta SCIM
      • Microsoft Entra
  • Glossary

On this page

  • Create notebook
  • Overview
  • Preview
  • Sharing & Activity
  • Settings
  • Versioning
  • Download notebook
  • Copy notebook
  • Launch notebook
  • Rename notebook
  • Delete notebook
  1. Catalog
  2. Code
  3. Notebooks

Notebook Assets

notebooks
catalog
Notebooks offer an easy way to analyze your data in the TileDB securely governed and compliant environment.
Data Science made easy

TileDB allows you to create, manage, and launch Jupyter notebooks inside its secure infrastructure. Notebooks are an easy way to perform exploratory, interactive data analysis.

Create notebook

From the Assets page (found in the left navigation menu), select the Add Asset button, and select Code and then Notebook. You have two choices:

  1. Create an empty notebook
  2. Upload an existing notebook from your machine

Choosing the option to either create a new empty notebook, or add an existing one from your machine. Choosing the option to either create a new empty notebook, or add an existing one from your machine.

When creating an empty notebook, you need to provide the physical storage path, notebook name, and cloud credentials that can access the physical path.

Creating an empty notebook from the UI console. Creating an empty notebook from the UI console.

When uploading an existing notebook, you need to provide the local file, the physical storage path, notebook name, and cloud credentials that can access the physical path.

Uploading an existing notebook from the UI console. Uploading an existing notebook from the UI console.

Once created, your notebook will appear under Assets -> Code -> Notebooks.

Browse all your notebooks in a single place. Browse all your notebooks in a single place.

Overview

The Overview tab provides basic information about a notebook:

  • Description - If you provided a description to the notebook (e.g., from Settings), it is visible here. The description is indexed and searchable in the catalog. Therefore, it’s recommended to add a meaningful description for all your assets.
  • TileDB URI - The unique resource identifier for TileDB, based on which you can refer to the notebook. It contains the namespace and the UUID of the asset.
  • UUID - The unique identifier for the notebook.
  • Original URI - The location on cloud storage where the asset is stored. This property is visible only to the admin of the asset.
  • Permissions - What rights the current user has on this asset.
  • Image - This is the default server environment under which the notebook will run. It is configurable through Settings. Launching a notebook will ask for this image.
  • Server profile - This shows the specifications of the server that will run the notebook upon its launch, among the options given by TileDB. It is configurable like the image option in Settings.
  • License - If available, under which license the asset is available. Editable through Settings, if you are the admin of the asset.
  • Tags - Any tags on the asset, if available, which will be searchable in the TileDB catalog.

The basic information about the notebook. The basic information about the notebook.

Preview

You can see a human-readable rendering of the notebook under the Preview tab.

Rendering the notebook in human-readable form. Rendering the notebook in human-readable form.

Sharing & Activity

The Sharing screen allows you to securely share your notebook with other TileDB users, whereas the Activity screen shows you the various accesses performed on the notebook by you or any other user with whom you have shared your notebook. They are both covered in detail in the Collaborate section.

Settings

In the notebook settings, you can modify the following:

  • Dashboard options - A notebook can be converted into a dashboard - visit the Dashboards section for more details.
  • Versioning options - You can manage the number of versions for your notebook - visit the Versioning subsection below.
  • Description - Note that this is indexed and, thus, searchable in the TileDB catalog.
  • License - The type of license for the notebook, especially if you are making this publicly available.
  • Tags - These can be used for efficient search in the catalog.
  • Mark as read-only - This is useful if you want to prevent any notebook changes by you or someone with whom you shared the notebook.
  • Make public - If you wish to share the notebook with all the TileDB users. This will appear in the Marketplace tab in the left navigation menu. If you make a notebook public, you can easily change it back to private in the same manner.
  • Cloud credentials - Credentials should be provided so that TileDB can securely access the notebook on the cloud store where it is physically stored.
  • Server type - Change the server with which to launch the notebook.
  • Default region - The region in which you wish to launch the notebook server by default.
  • Rename notebook - Read the Rename notebook subsection below.
  • Delete notebook - Read the Delete notebook subsection below.

Tweaking the notebook settings. Tweaking the notebook settings.

Versioning

TileDB supports versioning for notebooks. Every time you edit a notebook (e.g., after launching it on a TileDB server - read the Analyze section for details), TileDB creates a new version. You can select which version to preview by selecting the button next to the notebook name, starting with Latest version - .... This brings up a modal on the right, which allows you to browse and select different notebook versions. Then you can preview or download the selected notebook version.

Browsing and selecting different notebook versions. Browsing and selecting different notebook versions.

You can prune past versions from the Settings tab. You can also choose to auto-prune, which happens once a day and will keep only the number of versions you select in the drop-down list next to the Prune button.

You can prune past notebook versions. You can prune past notebook versions.

Download notebook

You can download the notebook in the native, Jupyter-readable format by selecting the Download button next to Launch.

Copy notebook

TileDB allows you to duplicate a notebook by copying it in another physical storage location and with a different name. Select the Copy notebook button that is next to the Download button. In the emerging window, you can change the owner (e.g., to an organization of which you are a member), add a new physical storage path, and change the name.

You can duplicate the notebook by creating a clone. You can duplicate the notebook by creating a clone.

Launch notebook

TileDB offers a powerful compute infrastructure, in which you can securly launch your notebook by selecting the Launch button. Visit the Analyze section for more details on launching notebooks in TileDB.

Rename notebook

You can rename a notebook from the Settings tab. This action does not alter or copy the contents of the notebook; it just registers the asset in the catalog under a different name.

You can programmatically rename a notebook as follows:

  • Python
tiledb.cloud.asset.update_info(
    "`tiledb://<account>/<previous_name>`",
    name="<new_name>",
    access_credentials_name="<acn>",  # Optional - The cloud credentials that access the notebook (should already exist in your account settings)
)
Warning

Take caution when renaming notebooks, as any URIs including the previous notebook name will no longer work.

Delete notebook

When deleting a notebook, you have two options:

  • Unregister: This operation removes the notebook from the TileDB catalog, but it does not physically remove it from the object store. Since the notebook will persist on storage, you can register it again in the TileDB catalog in the future.
  • Delete: This operation both unregisters and physically removes the notebook from storage. Note that this operation cannot be undone.

You can delete the notebook from the Settings tab, which will prompt you to choose among the two operations above.

The two options when removing a notebook. The two options when removing a notebook.

You can also programmatically delete or unregister a notebook as follows:

  • Python
# Unregister a notebook
tiledb.cloud.asset.deregister(uri="tiledb://<account>/<notebook_name>")

# Delete a notebook
tiledb.cloud.asset.delete(uri="tiledb://<account>/<notebook_name>")
Code
Dashboards