GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

ingestion-pipeline

Website
Wikipedia
https://static.github-zh.com/github_avatars/bruin-data?size=40
bruin-data / ingestr

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

BigQuerycopy-databasedata-ingestiondata-integrationdata-pipelineduckdbingestion-pipelinesql-serverPostgreSQLsnowflake
Python 2.97 k
4 天前
https://static.github-zh.com/github_avatars/opensemanticsearch?size=40
opensemanticsearch / open-semantic-etl

#自然语言处理#Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelin...

etlPythonOCRenrichmentsolrelasticsearchextractextract-textextractorextract-informationRDF (Resource Description Framework)documentspdfnamed-entity-recognitionannotationingestion-pipeline自然语言处理
Python 268
3 年前
https://static.github-zh.com/github_avatars/AstraBert?size=40
AstraBert / ingest-anything

From data to vector database effortlessly

自动化ingestion-pipelinellamaindexpdfqdrantvector-database
Python 65
1 个月前
https://static.github-zh.com/github_avatars/KnudsenMorten?size=40
KnudsenMorten / AzLogDcrIngestPS

AzLogDcrIngestPS - Unleashing the power of Log Ingestion API with Azure LogAnalytics custom table v2, Azure Data Collection Rules and Azure Data Ingestion Pipeline

Azuredataingestion-pipelinelogmanipulationPowerShell
PowerShell 32
5 个月前
https://static.github-zh.com/github_avatars/Morphl-AI?size=40
Morphl-AI / MorphL-Model-User-Search-Intent

#计算机科学#Google Cloud Storage connector, pre-processor and model for predicting user search intent based on keywords

机器学习自然语言处理pysparkpreprocessoringestion-pipelinepredict
Python 25
6 年前
https://static.github-zh.com/github_avatars/Morphl-AI?size=40
Morphl-AI / MorphL-Model-Publishers-Churning-Users

#计算机科学#Google Analytics connector, pre-processor and model for predicting churning users for digital publishers.

google-analyticspredictionpreprocessorpyspark机器学习ingestion-pipeline
Python 10
6 年前
https://static.github-zh.com/github_avatars/Clarifai?size=40
Clarifai / clarifai-python-datautils

Extract Transform and Load unstructured data into the Clarifai's AI platform

dataengineeringingestionunstructured-dataingestion-pipeline
Python 6
1 个月前
https://static.github-zh.com/github_avatars/akshaybahadur21?size=40
akshaybahadur21 / Emancipitaion-of-Apache-Spark

My experiments with Apache Spark for Humans ⭐

Apache Sparkarchitectureingestion-pipeline
Java 6
2 年前
https://static.github-zh.com/github_avatars/azuregig?size=40
azuregig / work_with_OrdnanceSurvey_data

Sample Azure Data Factory pipeline for ingesting Data Packages directly from the Download API of the Ordnance Survey Data Hub into Azure Storage.

geospatial-dataAzureingestion-pipelineingestion
4
3 年前
https://static.github-zh.com/github_avatars/tmcgrath?size=40
tmcgrath / cassandra-ingest

DataStax or Cassandra Ingest from Relational Databases with StreamSets

Apache Cassandraingestion-pipelinerdbmscdc
PLSQL 4
6 年前
https://static.github-zh.com/github_avatars/anhtuan284?size=40
anhtuan284 / chest-xray-multi-disease

#计算机科学#Multi-disease segmentation chest X-rays by YOLO and DenseNet121, CoAtNet models

机器视觉深度学习flask-apiflutter-appsyoloingestion-pipelinellamaindexollamarag
Jupyter Notebook 4
1 个月前
https://static.github-zh.com/github_avatars/xinmiao14?size=40
xinmiao14 / opensky-flight-pipeline

Real-time flight data fetching, cleaning, and analytics API using FastAPI, Pandas, PostgreSQL, and Python.

backend-apidata-engineeringFastAPIingestion-pipelinePostgreSQL
Python 3
1 个月前
https://static.github-zh.com/github_avatars/CyberCRI?size=40
CyberCRI / welearn-datastack

#学习与技能提升#Data stack for WeLearn LPI projects. This pipeline can collect, vectorize and store data from various sources.

dataingestion-pipelinelearningsdg
HTML 2
12 天前
https://static.github-zh.com/github_avatars/siddharth271101?size=40
siddharth271101 / Stock-Exchange-Analysis

Created a data pipeline using sqoop to ingest data from sql server into the hive table and used hive for feature engineering and analysis.

big-datahivesqoopMySQLingestion-pipeline
Shell 2
5 年前
https://static.github-zh.com/github_avatars/Charanaicore?size=40
Charanaicore / multinational-retail-data-centralisation

The multinational retail data contralisation project is a data warehousing project that focuses on ingesting data from disparate sources to create a centralised warehouse

data-warehouseingestion-pipeline
Python 1
2 年前
https://static.github-zh.com/github_avatars/rohitshubham?size=40
rohitshubham / Cloud-pipeline

A real-life end-to-end cloud sub-system scenario

Docker ComposeApache Sparkspark-streamingkafkakafka-brokersMongoDBingestion-pipelinestreaming-analytics
Python 1
1 年前
https://static.github-zh.com/github_avatars/SiyaMathe?size=40
SiyaMathe / Building-A-Scalable-Data-Architecture-With-Microservices

Explores the design and implementation of a modern, adaptable data infrastructure using microservices.

big-datadata-engineeringetlingestion-pipeline微服务bigdataanalytics
1
3 个月前
https://static.github-zh.com/github_avatars/SandeepGitGuy?size=40
SandeepGitGuy / Insurance_Documents_QA_Chatbot_RAG_LlamaIndex_LangGraph

#大语言模型#A Question Answering(Q/A) Chatbot on Insurance Documents. Powered by Retrieval Augmented Generation(RAG), LlamaIndex and LangGraph. Inspired from my Upgrad_IIITB PG Course.

聊天机器人chromadbdiskcacheingestion-pipelinelangchainlanggraphllama-index大语言模型openaiopenaiapiPythonquestion-answeringvector-store
Jupyter Notebook 1
6 个月前
https://static.github-zh.com/github_avatars/rachita27?size=40
rachita27 / AUTOMATING

Automating Ingestion Excel Files On To Azure Data Studio (SQL-Server)

AzurefunctionsPythonJupyter Notebookingestion-pipelineingestionSQLsql-serverpandasexcel
Jupyter Notebook 1
3 年前
https://static.github-zh.com/github_avatars/SandeepGitGuy?size=40
SandeepGitGuy / Insurance_Documents_QA_Chatbot_RAG_LlamaIndex_LangChain

#大语言模型#A Question Answering(Q/A) Chatbot on Insurance Documents. Powered by Retrieval Augmented Generation(RAG), LlamaIndex and LangChain. Inspired from my Upgrad_IIITB PG Course.

聊天机器人chromadbdiskcacheingestion-pipelinelangchainllama-index大语言模型openaiPythonquestion-answeringvector-store
Jupyter Notebook 1
6 个月前
loading...