kas's picture

kas

shing3232

·

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

akhaliq/voxel-deepseek-terminus

liked a model 2 months ago

Aleph-Alpha/llama-tfree-hat-pretrained-7b-dpo

new activity 3 months ago

deepseek-ai/DeepSeek-V3.1:tool call for reasoning mode

View all activity

Organizations

None yet

upvoted a paper 7 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 58

upvoted an article 7 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

271

upvoted 2 papers 7 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7 • 26

upvoted a collection about 1 year ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Jul 21 • 347

upvoted 3 papers over 1 year ago

BASS: Batched Attention-optimized Speculative Sampling

Paper • 2404.15778 • Published Apr 24, 2024 • 11

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4, 2024 • 101

ChatEDA: A Large Language Model Powered Autonomous Agent for EDA

Paper • 2308.10204 • Published Aug 20, 2023 • 1

upvoted 3 collections over 1 year ago

Camelidae

5 items • Updated Aug 22 • 2

Microsoft Research Papers

#PapersToRead from Microsoft Research in the broad space of Generative AI, Multi-agent systems, responsible AI practices, LLM Ops, and language models • 20 items • Updated Jun 26, 2024 • 5

Papers

Large Language Model (LLM) and NLP related papers. • 331 items • Updated 15 days ago • 13

upvoted a paper over 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 625

upvoted a paper almost 2 years ago

LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4, 2024 • 38