6 46 94

Jian Liao PRO

imjliao

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

jinaai/jina-embeddings-v4

liked a model 2 months ago

ChatDOC/OCRFlux-3B

liked a model 3 months ago

microsoft/VibeVoice-1.5B

View all activity

Organizations

upvoted a paper 3 months ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 120

upvoted 2 articles 5 months ago

Article

GRPO for GUI Grounding Done Right

•

Jun 11

• 34

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25

• 301

upvoted a paper 6 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 45

upvoted a paper 7 months ago

T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models

Paper • 2504.04718 • Published Apr 7 • 42

upvoted 3 papers 8 months ago

upvoted a collection 9 months ago

Awesome Computer Use Agents

Collection

https://github.com/ranpox/awesome-computer-use • 25 items • Updated Dec 18, 2024 • 17

upvoted a paper 9 months ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 71

upvoted a collection 10 months ago

Qwen2.5-VL (All Versions)

Collection

All versions of Qwen2.5-VL including the new 32B version and 4-bit, 16-bit and more! • 16 items • Updated 13 days ago • 21

upvoted a paper 12 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

upvoted 3 articles about 1 year ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27, 2024

• 41

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 273

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 78

upvoted a paper over 1 year ago

THOUGHTSCULPT: Reasoning with Intermediate Revision and Search

Paper • 2404.05966 • Published Apr 9, 2024 • 2

upvoted a collection over 1 year ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 167

upvoted a paper over 1 year ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 81

upvoted 2 papers almost 2 years ago

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Paper • 2402.05930 • Published Feb 8, 2024 • 39

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

Paper • 2401.12954 • Published Jan 23, 2024 • 33

Jian Liao PRO

AI & ML interests

Recent Activity

Organizations

imjliao's activity

GRPO for GUI Grounding Done Right

Tiny Agents: an MCP-powered agent in 50 lines of code

Visually Multilingual: Introducing mcdse-2b

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Docmatix - a huge dataset for Document Visual Question Answering