diff --git a/documentation/docs/guide/datasets/data-connectors-format.md b/documentation/docs/guide/datasets/data-connectors-format.md index 4f07753bf8..be0efd8014 100644 --- a/documentation/docs/guide/datasets/data-connectors-format.md +++ b/documentation/docs/guide/datasets/data-connectors-format.md @@ -7,7 +7,10 @@ H2O LLM Studio supports the following data connectors to access or upload extern - **Upload**: Upload a local dataset from your machine. - **Local**: Specify the file location of the dataset on your machine. - **AWS S3 (Amazon AWS S3)**: Connect to an Amazon AWS S3 data bucket. -- **Kaggle**: Connect to a Kaggle dataset. +- **Azure Datalake**: Connect to a dataset in Azure Datalake. +- **H2O Drive**: Upload a dataset from H2O Drive. +- **Kaggle**: Connect to a dataset hosted on Kaggle. +- **Hugging Face**: Connect to a dataset on Hugging Face. ## Data format diff --git a/documentation/docs/guide/datasets/import-dataset.md b/documentation/docs/guide/datasets/import-dataset.md index ddd11097fd..b10eb328dd 100644 --- a/documentation/docs/guide/datasets/import-dataset.md +++ b/documentation/docs/guide/datasets/import-dataset.md @@ -90,6 +90,30 @@ Follow the relevant steps below to import a dataset to H2O LLM Studio. + +
    +
  1. + Enter values for the following fields: +
      +
    • + Datalake connection string:

      + Enter your Azure connection string to connect to Datalake storage. +
    • +
    • + Datalake container name:

      + Enter the name of the Azure Data Lake container where your dataset is stored, including the relative path to the file within the container. +
    • +
    • + File name:

      + Specify the exact name of the file you want to import. +
    • +
    +
  2. +
  3. + Click Continue. +
  4. +
+
  1. @@ -126,6 +150,30 @@ Follow the relevant steps below to import a dataset to H2O LLM Studio.
+ +
    +
  1. + Enter values for the following fields: +
      +
    • + Hugging Face dataset:

      + Enter the name of the Hugging Face dataset. +
    • +
    • + Split:

      + Enter the specific data split you want to import (e.g., "train", "test"). +
    • +
    • + Hugging Face API token (optional):

      + Enter your Hugging Face API token to authenticate access to private datasets or datasets with gated access. +
    • +
    +
  2. +
  3. + Click Continue. +
  4. +
+
:::