Skip to content
View jacksonllee's full-sized avatar

Highlights

  • Pro

Organizations

@pycantonese @conda-forge @linguistica-uchicago

Block or report jacksonllee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Word segmentation models

Python 3 1 Updated Mar 11, 2023

ISO 639 language codes

Python 34 3 Updated May 9, 2024

Linguistica 5: Unsupervised Learning of Linguistic Structure

Python 30 16 Updated Jul 4, 2019

Massively multilingual pronunciation mining

Python 315 71 Updated Sep 21, 2024

Cantonese Linguistics and NLP

Python 357 39 Updated May 23, 2024

Language Acquisition Research Tools

Python 37 18 Updated Mar 29, 2024

Automated testing for the examples in your documentation.

Python 69 14 Updated Sep 22, 2024

Rime Cantonese input schema | 粵語拼音輸入方案

Python 538 62 Updated Sep 19, 2024

A python port of the glmnet package for fitting generalized linear models via penalized maximum likelihood.

Python 262 59 Updated Jul 24, 2024

Spoken mandarin Chinese from Hong Kong.

9 3 Updated May 5, 2024

Spoken Cantonese from Hong Kong.

28 1 Updated May 5, 2024

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,736 961 Updated Feb 15, 2023

Benchmarks of approximate nearest neighbor libraries in Python

Python 4,876 734 Updated Sep 2, 2024

Multilayer Feed-Forward Neural Network predictive model implementations with TensorFlow and scikit-learn

Python 45 18 Updated Nov 29, 2022

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Python 8,102 1,393 Updated Sep 28, 2024

Community Curated NLP List

195 33 Updated Jul 25, 2022

English data

Python 199 42 Updated Aug 28, 2024

A topic-centric list of HQ open datasets.

60,272 9,860 Updated Sep 6, 2024

Repository for Frequency Word List Generator and processed files

C# 1,164 554 Updated Feb 7, 2022

Unsupervised learning of root-and-pattern morphology in Python

Perl 4 1 Updated Apr 2, 2016

Resources for Instrumented Item-and-Pattern morphology

Jupyter Notebook 5 1 Updated Feb 25, 2016

Python tools for linguistics research

Python 11 Updated Aug 22, 2016

Stand-alone language identification system

Python 2,301 317 Updated Jan 1, 2020

[Deprecated] New development at https://github.com/linguistica-uchicago/lxa5

Python 5 2 Updated Feb 25, 2016

NLTK Source

Python 13,463 2,866 Updated Sep 25, 2024