Chaojun XIAO's picture

12 9 7

Chaojun XIAO

xcjthu

·

https://xcjthu.github.io/

xcjthu

AI & ML interests

NLP、information extraction

Recent Activity

updated a model about 2 months ago

openbmb/MiniCPM4-0.5B

updated a model 2 months ago

openbmb/MiniCPM4.1-8B

authored a paper 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

View all activity

Organizations

upvoted 2 papers 2 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29 • 13

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28 • 118

upvoted a paper 6 months ago

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

upvoted a collection 6 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8 • 79

upvoted 3 papers about 1 year ago

Densing Law of LLMs

Paper • 2412.04315 • Published Dec 5, 2024 • 19

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

Configurable Foundation Models: Building LLMs from a Modular Perspective

Paper • 2409.02877 • Published Sep 4, 2024 • 31

upvoted 2 papers over 1 year ago

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 24

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46