Shira Guskin's picture

9 4

Shira Guskin

sguskin

·

shira-g

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

published an article about 1 month ago

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

updated a model 12 months ago

OpenVINO/Llama-3.1-8B-Instruct-FastDraft-150M-int8-ov

View all activity

Organizations

upvoted an article about 1 month ago

Article

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

Sep 29

• 20

upvoted a collection 12 months ago

Speculative Decoding Draft Models

Collection of OpenVINO optimized efficient draft models for speculative decoding • 4 items • Updated Sep 16 • 10

upvoted a paper about 1 year ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 39