#大语言模型#Official Repository of "LLM × DATA" Survey Paper
DSIR large-scale data selection framework for language model training
Graphical tool for data manipulation written in C++/Qt.
GUNDAM is a data management system that prioritizes data using language models.
⏳ Provide filtering, sanitizing, and conversion of Golang data. 提供对Golang数据的过滤,净化,转换。
A GraphQL like interface to map a request to eloquent query with data transformation for Laravel.
Exponentially Weighted Moving Average Filter
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
#数据仓库#Framework for processing and filtering datasets
R Tutorial: useful R codes for cleaning and filtering data from Qualtrics surveys, and for creating new variables in the dataframe. With step-by-step explanations.
This repository contains all (Python 3) code and libraries required for the 2022-2023 Notre Dame Rocketry Team (NDRT) Apogee Control System (ACS). It also contains sensor/actuator example code and fli...
EpiMethEx (Epigenetic Methylation and Expression), a R package to perform a large-scale integrated analysis by cyclic correlation analyses between methylation and gene expression data.
Data extraction from smartphones and GPS and Accelerometer data "fusion" with Kalman filter.
SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking
Base-call error-filtering and read preprocessing pipeline for fastq libraries
Anonymises data inside text files and in sheet files. It recognises and removes various sorts of personally identifiable information (PII). Each removed part is replaced with a suitable generic text, ...
#安卓#CDC Connect is a cross-platform mobile application built in React Native using JavaScript. The app is designed for data collection with a focus on surveys.
Make the data grid's Auto Filter Row insensitive to accents.
PHP | SQL - DESC LIMIT ile istenilen sayıda veri çekme işlemi