TensorBLEU: Vectorized GPU-based BLEU Score Implementation for Per-Sentence In-Training Evaluation Paper • 2510.05485 • Published Oct 7 • 7 • 2
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models Paper • 2510.03561 • Published Oct 3 • 24 • 2
Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction Paper • 2510.01817 • Published Oct 2 • 15 • 2