Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 23 days ago • 66
GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings Paper • 2509.10844 • Published Sep 13 • 2
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated 21 days ago • 149
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Paper • 2401.12474 • Published Jan 23, 2024 • 36
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116
Extending Context Window of Large Language Models via Positional Interpolation Paper • 2306.15595 • Published Jun 27, 2023 • 53