Skip to main content

Overview

The 4Minds platform integrates with popular data platforms to streamline your workflow. Instead of manually downloading and uploading files, connect directly to your existing data sources and import what you need.

Supported Integrations

Screen Shot2025 11 25at7 01 32PM Pn

Azure Blob Storage

Connect to Azure Blob Storage to access your cloud-stored datasets and files. Import data directly from your Azure containers without manual downloads, maintaining your existing cloud storage infrastructure.

Amazon S3

Connect to Amazon S3 to access your cloud-stored datasets and files. Import data directly from your S3 buckets without manual downloads, leveraging your existing AWS storage infrastructure.

Box

Connect to Box to access your cloud-stored documents and files. Import data directly from your Box folders without manual downloads, leveraging your existing Box storage infrastructure for seamless collaboration and file management.

CoreWeave Storage

Connect to CoreWeave Storage to access your datasets stored on CoreWeave’s high-performance cloud infrastructure. Import data directly from your CoreWeave storage without manual transfers.

Databricks workspaces

Link your Databricks workspace to pull data from tables, notebooks, and files. This integration lets you leverage existing data pipelines without duplicating storage or creating manual export processes.

HuggingFace Hub

Connect to the HuggingFace Hub to access thousands of public datasets and models. Browse available resources, preview metadata, and import datasets directly into your 4Minds workspace.
Parquet files are not supported when importing datasets from Hugging Face. Please use CSV, JSON, or other supported formats instead.

SharePoint

Connect to SharePoint to access documents and files stored in your organization’s SharePoint sites. Import data directly from SharePoint libraries, maintaining your existing document management workflows.

Google Cloud Storage

Connect to Google Cloud Storage to access your cloud-stored datasets and files. Import data directly from your GCS buckets without manual downloads, maintaining your existing Google Cloud storage infrastructure.

Snowflake

Connect to Snowflake to access your data warehouse tables and views. Import structured data directly from Snowflake without manual exports, leveraging your existing data warehouse infrastructure.

Office 365

Connect to Office 365 for seamless access to emails, calendars, OneDrive files, Teams messages, and OneNote. Integrate your Microsoft 365 ecosystem to leverage organizational knowledge and collaboration data.

Microsoft Fabric

Connect to Microsoft Fabric to access your unified analytics platform data. Browse and import directly from your Fabric Lakehouses (files) and Warehouses (tables), leveraging Microsoft’s end-to-end analytics solution for seamless data integration.

NetApp

Connect to NetApp storage systems to access your enterprise data and files. Import data directly from NetApp storage volumes, leveraging your existing enterprise storage infrastructure for seamless data integration.

How it Works

  1. Connect - Authenticate with your external platform using API keys or OAuth
  2. Browse - Explore available datasets and resources from the connected source
  3. Import - Select the data you want and initiate the import process
  4. Track - Monitor import progress with real-time status updates
  5. Validate - Confirm successful connections and data integrity before training

Upload Size Limit

  • You can upload up to 100 MB of data at a time. This applies to single files, multiple files, or integration datasets. A progress bar will display the total upload size.
  • To upload more data, simply reopen the dataset and upload the next 100 MB batch.
    There is no limit on the overall dataset size, only on each individual upload batch.

Managing Connections

All active integrations appear in your Control Center. You can:
  • View connection status and last sync time
  • Test connections to verify they’re working
  • Update credentials or permissions
  • Disconnect sources you no longer need
Once imported, datasets from external sources work exactly like uploaded files: they’re processed through the same ETL and Graph engines to build your model’s knowledge base.

Automatic Data Synchronization (Rsync)

Keep your datasets current with automatic synchronization from connected integrations. When you create a dataset from an integration source (like a OneDrive folder), 4Minds saves a manifest tracking the files in that location. Each time you log in, the platform checks your connected sources for new files and automatically fetches any additions to your dataset. How it works:
  1. Create a dataset from an integration source (e.g., a OneDrive folder)
  2. 4Minds stores a manifest of the files at creation time
  3. On each login, the platform compares the current source contents against the manifest
  4. New files are automatically retrieved and added to your dataset
This ensures your AI models always have access to the latest data without manual re-imports.