Skip to content
Change the repository type filter

All

    Repositories list

    • zerox

      Public
      PDF to Markdown with vision models
      Python
      MIT License
      590000Updated Nov 17, 2024Nov 17, 2024
    • SDG is a specialized framework designed to generate high-quality structured tabular data.
      Python
      Apache License 2.0
      551000Updated Nov 2, 2024Nov 2, 2024
    • An open-source framework that simplifies implementation of data solutions.
      TypeScript
      Apache License 2.0
      23000Updated Oct 31, 2024Oct 31, 2024
    • In this repository you may find KQL (Kusto Query Language) queries and Watchlist schemes for data sources related to Microsoft Sentinel (a SIEM tool).
      MIT License
      23000Updated Oct 31, 2024Oct 31, 2024
    • A utility for Migrating Data between Oracle, Postgres, MySQL MariaDB, Snowflake. Stage Data from supported database to Amazon S3 and Azure Blob Storage in JSON and CSV Formats
      JavaScript
      MIT License
      10000Updated Oct 31, 2024Oct 31, 2024
    • Data Pipeline based on Medallion Architecture using Azure Data Factory, Databricks and DBT.
      Python
      1000Updated Oct 31, 2024Oct 31, 2024
    • AI-in-a-Box leverages the expertise of Microsoft across the globe to develop and provide AI and ML solutions to the technical community. Our intent is to present a curated collection of solution accelerators that can help engineers establish their AI/ML environments and solutions rapidly and with minimal friction.
      Jupyter Notebook
      MIT License
      189000Updated Oct 28, 2024Oct 28, 2024
    • One framework to develop, deploy and operate data workflows with Python and SQL.
      Python
      Apache License 2.0
      57000Updated Oct 28, 2024Oct 28, 2024
    • Fast data quality framework for modern data infrastructure
      Scala
      GNU Lesser General Public License v3.0
      5000Updated Oct 24, 2024Oct 24, 2024
    • In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
      Jupyter Notebook
      6000Updated Oct 19, 2024Oct 19, 2024
    • RLS (Row-level Security) Implementation on Unity Catalog initiated Databricks. Using ROW FILTER
      1000Updated Oct 18, 2024Oct 18, 2024
    • Deploy a multi-account cloud foundation to support highly-regulated workloads and complex compliance requirements.
      TypeScript
      Apache License 2.0
      472000Updated Oct 17, 2024Oct 17, 2024
    • Databricks Platform - Architecture, Security, Automation and much more!!
      Jupyter Notebook
      27000Updated Oct 16, 2024Oct 16, 2024
    • Bicep
      MIT License
      17000Updated Oct 16, 2024Oct 16, 2024
    • The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest risks that customers ask about most often.
      HCL
      Other
      45000Updated Oct 15, 2024Oct 15, 2024
    • This Sample Datawarehouse Project is an integration from Informatica Cloud(IICS) to Snowflake and vice versa for ETL/ELT.
      1000Updated Oct 14, 2024Oct 14, 2024
    • Azure Cognitive Search + Azure OpenAI Accelerator
      Jupyter Notebook
      MIT License
      956000Updated Oct 7, 2024Oct 7, 2024
    • Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt.
      Python
      5000Updated Sep 30, 2024Sep 30, 2024
    • This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The ficticious organization is an e-commerce company.
      Python
      5000Updated Sep 30, 2024Sep 30, 2024
    • A Streamlit app for assessing data quality in Snowflake
      Python
      MIT License
      2000Updated Sep 29, 2024Sep 29, 2024
    • The ADF Universal Framework is an open-source project designed to provide a comprehensive and flexible solution for building scalable and efficient data integration workflows using Azure Data Factory (ADF).
      TSQL
      The Unlicense
      3000Updated Sep 6, 2024Sep 6, 2024
    • The ADF Universal Framework is an open-source project designed to provide a comprehensive and flexible solution for building scalable and efficient data integration workflows using Azure Data Factory (ADF).
      TSQL
      The Unlicense
      3000Updated Sep 6, 2024Sep 6, 2024
    • Building a Data Lakehouse using the Medallion architecture.
      Jupyter Notebook
      3000Updated Sep 1, 2024Sep 1, 2024
    • Python scripts for Azure Blob Storage data ingestion into Snowflake. Includes a manual version and an HTTP request version for Azure Functions.
      Python
      2000Updated Aug 24, 2024Aug 24, 2024
    • Azure MLOps (v2) solution accelerators. Enterprise ready templates to deploy your machine learning models on the Azure Platform.
      Python
      MIT License
      704000Updated Aug 7, 2024Aug 7, 2024
    • Data Pipeline with Delta Lake using Medallion architecture
      Jupyter Notebook
      2000Updated Aug 6, 2024Aug 6, 2024
    • ML Ops Accelerator: Databricks & Azure Machine Learning Unification
      Python
      MIT License
      68000Updated Aug 5, 2024Aug 5, 2024
    • Azure Analytics End to End with Azure Synapse - Deployment Accelerator
      Bicep
      MIT License
      124000Updated Jul 16, 2024Jul 16, 2024
    • 🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
      Python
      Other
      1.5k000Updated Jul 13, 2024Jul 13, 2024
    • End-to-End data pipeline framework for data quality and validation including Flask API
      Python
      1000Updated Jul 13, 2024Jul 13, 2024