9 6 20

Bilge Yücel

bilgeyucel

AI & ML interests

NLP, Semantic Search, LLMs

Recent Activity

updated a Space about 1 month ago

bilgeyucel/captionate

upvoted an article about 2 months ago

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

liked a model 2 months ago

google/embeddinggemma-300m

View all activity

Organizations

updated a Space about 1 month ago

Captionate

📸

Generate Instagram captions from images

upvoted an article about 2 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

• 191

liked a model 2 months ago

google/embeddinggemma-300m

upvoted an article 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

• 253

reacted to anakin87's post with ❤️ 3 months ago

Post

1087

Haystack can now see 👀

The latest release of the Haystack OSS LLM framework adds a long-requested feature: image support!

📓 Notebooks below

This isn't just about passing images to an LLM. We built several features to enable practical multimodal use cases.

What's new?
🧠 Support for multiple LLM providers: OpenAI, Amazon Bedrock, Google Gemini, Mistral, NVIDIA, OpenRouter, Ollama and more (support for Hugging Face API coming 🔜)
🎛️ Prompt template language to handle structured inputs, including images
📄 PDF and image converters
🔍 Image embedders using CLIP-like models
🧾 LLM-based extractor to pull text from images
🧩 Components to build multimodal RAG pipelines and Agents

I had the chance of leading this effort with @sjrhuschlee (great collab).

📓 Below you can find two notebooks to explore the new features:
󠁯•󠁏󠁏 Introduction to Multimodal Text Generation https://haystack.deepset.ai/cookbook/multimodal_intro
󠁯•󠁏󠁏 Creating Vision+Text RAG Pipelines https://haystack.deepset.ai/tutorials/46_multimodal_rag

(🖼️ image by @bilgeyucel )

reacted to anakin87's post with 🔥 3 months ago

Post

388

🕵️🌐 Building Browser Agents - notebook

No API? No problem.
Browser Agents can use websites like you do: click, type, wait, read.

📓 Step-by-step notebook: https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/browser_agents.ipynb

🎥 In the video, the Agent:
- Goes to Hugging Face Spaces
- Finds black-forest-labs/FLUX.1-schnell
- Expands a short prompt ("my holiday on Lake Como") into a detailed image generation prompt
- Waits for the image
- Returns the image URL

## What else can it do?
Great for information gathering and summarization

🗞️🗞️ Compare news websites and create a table of shared stories with links
▶️ Find content creator social profiles from YouTube videos
🛍️ Find a product's price range on Amazon
🚂 🚌 Gather public transportation travel options

## How is it built?
🏗️ Haystack → Agent execution logic
🧠 Google Gemini 2.5 Flash → Good and fast LLM with a generous free tier
🛠️ Playwright MCP server → Browser automation tools: navigate, click, type, wait...

Even without vision capabilities, this setup can get quite far.

## Next steps
- Try a local open model
- Move from notebook to real deployment
- Incorporate vision

And you? Have you built something similar? What's in your stack?

liked a model 4 months ago

Trendyol/Trendyol-LLM-8B-T1

Text Generation • 8B • Updated Jul 17 • 206 • • 23

liked a dataset 4 months ago

Pablinho/movies-dataset

Viewer • Updated Jul 31, 2024 • 9.84k • 341 • 8

liked a model 10 months ago

sentence-transformers/static-retrieval-mrl-en-v1

liked a model 12 months ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • 4B • Updated Sep 26, 2024 • 380k • 709

reacted to alielfilali01's post with 👀 about 1 year ago

Post

1773

I feel like this incredible resource hasn't gotten the attention it deserves in the community!

@clefourrier and generally the HuggingFace evaluation team put together a fantastic guidebook covering a lot about 𝗘𝗩𝗔𝗟𝗨𝗔𝗧𝗜𝗢𝗡 from basics to advanced tips.

link : https://github.com/huggingface/evaluation-guidebook

I haven’t finished it yet, but i'am enjoying every piece of it so far. Huge thanks @clefourrier and the team for this invaluable resource !