Language Breakdown
Lines of code distribution across 82 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in HTML
Collaboration Network
Global Impact visualization
Repos
333
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Bowen Han
@bugparty
hulk
@git-hulk
Christina Lin
@weimeilin79
Asuka Minato
@asukaminato0721
Derui Yang
@YdrMaster
Top Repositories
使用 Prompts 和 Chains 让 ChatGPT 成为神奇的生产力工具!Unlocking the power of LLMs.
SpaCy 中文模型 | Models for SpaCy that support Chinese
A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.
汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components, which can be used as character shape features in machine learning.
汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese characters (pronunciation features, glyph features) as features for deep learning
人民日报语料处理工具集 | Tools for Corpus of People's Daily
一个基于 Rasa 的中文天气情况问询机器人(chatbot), 带 Web UI 界面
The ATIS (Airline Travel Information System) Dataset
一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a practical, hands-on approach to understanding NLP concepts, featuring multiple tokenization algorithms and customizable models. Ideal for students, researchers, and NLP enthusiasts..
rasa_chinese 专门针对中文语言的 rasa 组件扩展包,提供了许多针对中文语言的组件
Open Source Impact
Contributions to external projects