#网络爬虫#A configurable web spider with a easy-to-use web console
#自然语言处理#NEWS: JATE2.0 Beta.11 Released, see details below.
#自然语言处理#Extension of the SentenceSimplification project
#自然语言处理#The code base of the front-end of nocodefunctions.com
#自然语言处理#Document Enrichment plugin for Elasticsearch
TextDigester: document summarization java library
Functional and structural analysis of tables in research papers (Table disentangling)
Java implementation of Rapid Automatic Keyword Extraction Algorithm
JRuby gem to pdf to text while keeping the layout from original pdf file
QTLTableMiner++ tool for mining tables in scientific articles
AdSeeker is Advertisement Engine (Sentiment Analysis) developed using Java Jersey REStful, Apache Jena & Hibernate ORM
Sample ocr using opencv (just toy project) since 2017..
SparseTP: Efficient Topic Modeling on Phrases via Sparsity
Maven project for Yonsei Hands-on Text Mining course
#自然语言处理#A Rapidminer extension for easy Chinese language processing and text mining
#自然语言处理#Recovery of ActiveWatch statistical text analysis from 20th Century Java code saved on a CD-ROM disk. This probably should be rewritten, but can now demonstrate AW mapping of dynamic text content and ...
Implementation of text mining of medical records in Java
#自然语言处理#All the Lab work for the course that I took for NLP.