23 59 54

Joya Chen PRO

chenjoya

https://chenjoya.github.io/

chenjoya

AI & ML interests

Video LLM

Recent Activity

upvoted a paper about 9 hours ago

Revisiting Multimodal Positional Encoding in Vision-Language Models

upvoted a paper about 23 hours ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

upvoted a paper 7 days ago

ChronoPlay: A Framework for Modeling Dual Dynamics and Authenticity in Game RAG Benchmarks

View all activity

Organizations

Collections 1

Stream webcam images and chat in real-time

Runtime error

LiveCC

🐠

LiveCC-7B-Instruct

Runtime error

Videollm Online

🏢

Upload a video and ask questions in real-time

models 5

chenjoya/LiveCC-7B-Base

8B • Updated Apr 25 • 11 • 6

chenjoya/LiveCC-7B-Instruct

8B • Updated Apr 25 • 814 • 40

chenjoya/Qwen2-VL-7B-LLaVAInstruct

8B • Updated Apr 16 • 3 • 1

chenjoya/Qwen2-VL-7B-LiveCCInstruct

8B • Updated Apr 14 • 1 • 1

chenjoya/videollm-online-8b-v1plus

Video-Text-to-Text • Updated Jul 13, 2024 • 13.4k • 30

datasets 4

chenjoya/spc_demo_videos

Viewer • Updated Sep 15 • 5 • 9

chenjoya/Live-WhisperX-526K

Preview • Updated Aug 4 • 13.1k • 6

chenjoya/Live-CC-5M

Preview • Updated May 2 • 1.2k • 4

chenjoya/videollm-online-chat-ego4d-134k

Updated Jun 18, 2024 • 83 • 13

Joya Chen PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 10

spaces 3 Sort: Recently updated

SimpleStreamTrigger

LiveCC

Videollm Online

models 5 Sort: Recently updated

datasets 4 Sort: Recently updated

spaces 3

models 5

datasets 4