2 6 2

Maciej Pióro

maciek-pioro

maciek-pioro

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models

authored a paper 3 days ago

$μ$-Parametrization for Mixture of Experts

authored a paper 3 days ago

KaVa: Latent Reasoning via Compressed KV-Cache Distillation

View all activity

Organizations

authored 3 papers 3 days ago

A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models

Paper • 2504.05496 • Published Apr 7

$μ$-Parametrization for Mixture of Experts

Paper • 2508.09752 • Published Aug 13 • 10

KaVa: Latent Reasoning via Compressed KV-Cache Distillation

Paper • 2510.02312 • Published Oct 2 • 1

updated a model 4 months ago

maciek-pioro/joint-moe-scaling-laws

Updated Jul 15

published a model 6 months ago

maciek-pioro/joint-moe-scaling-laws

Updated Jul 15

upvoted a paper 6 months ago

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

Paper • 2505.03821 • Published May 3 • 25

upvoted a paper 7 months ago

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient

Paper • 2502.05172 • Published Feb 7 • 2

authored a paper 9 months ago

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

Paper • 2310.15961 • Published Oct 24, 2023 • 1

upvoted a paper about 1 year ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 144

updated a model over 1 year ago

maciek-pioro/Mixtral-8x7B-v0.1-pl

Feature Extraction • 47B • Updated Apr 29, 2024 • 2 • 5

New activity in maciek-pioro/Mixtral-8x7B-v0.1-pl over 1 year ago

Tokenizer is missing

#2 opened over 1 year ago by

djstrong

Update README.md

#1 opened over 1 year ago by

lyzell2

liked a dataset over 1 year ago

PleIAs/YouTube-Commons

Updated Jun 26, 2024 • 3.67k • 367

upvoted a paper over 1 year ago

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12, 2024 • 14

authored a paper over 1 year ago

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12, 2024 • 14

upvoted 2 papers almost 2 years ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 159

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73

authored a paper almost 2 years ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73

liked a dataset over 2 years ago

Simontwice/premise_selection_in_isabelle

Preview • Updated Mar 13, 2023 • 22 • 11

updated a model over 2 years ago

maciek-pioro/llama-fixed-tokenizer

Text Generation • Updated Apr 18, 2023 • 5

Maciej Pióro

AI & ML interests

Recent Activity

Organizations

maciek-pioro's activity

Tokenizer is missing

Update README.md