Talha Rüzgar Akkuş's picture

Talha Rüzgar Akkuş

Q-bert

·

AI & ML interests

AI, NLP, Math Hypothesis,NP Problems,Competitive programming

Recent Activity

updated a dataset about 15 hours ago

Q-bert/InstrucTurca-formatted-50k

published a dataset about 15 hours ago

Q-bert/InstrucTurca-formatted-50k

updated a dataset 1 day ago

Q-bert/InstrucTurca-formatted

View all activity

Organizations

upvoted a paper 4 months ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Paper • 2507.08800 • Published Jul 11 • 79

upvoted a paper 5 months ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10 • 54

upvoted a paper 6 months ago

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published Apr 14 • 12

upvoted 2 papers 7 months ago

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Paper • 2504.13626 • Published Apr 18 • 7

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 137

upvoted 3 papers 8 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 73

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published Mar 3 • 32

upvoted 2 papers 9 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 50

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 248

upvoted an article 10 months ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 490

upvoted a paper 10 months ago

Enhancing Human-Like Responses in Large Language Models

Paper • 2501.05032 • Published Jan 9 • 58

upvoted a collection 11 months ago

Human-Like LLMs

Human-Like LLMs series. • 5 items • Updated Jan 20 • 13

upvoted 2 papers about 1 year ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126

upvoted 2 papers almost 2 years ago

TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation

Paper • 2401.14373 • Published Jan 25, 2024 • 11

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 146

upvoted a collection almost 2 years ago

Mamba

Mamba SSM Models with hf_integration. • 7 items • Updated Dec 28, 2023 • 7

upvoted a paper about 2 years ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 172