#自然语言处理#General technology for enabling AI capabilities w/ LLMs and MLLMs
📃Language Model based sentences scoring library
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language M...
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
#大语言模型#The LM Contamination Index is a manually created database of contamination evidences for LMs.
#自然语言处理#Bangla-Bert is a pretrained bert model for Bengali language
Korean text normalization and language preparation package for LM in Kaldi-based ASR system
#自然语言处理#🐍 Python library for n-gram models in ARPA format
#自然语言处理#Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Codes for the experiments in our EMNLP 2021 paper "Open Aspect Target Sentiment Classification with Natural Language Prompts"
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.