Paper2Poster

community

https://paper2poster.github.io/

Paper2Poster/Paper2Poster

AI & ML interests

None defined yet.

Recent Activity

KevinQHLin authored a paper 10 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

KevinQHLin submitted a paper 11 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

KevinQHLin authored a paper 11 days ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

View all activity

KevinQHLin

authored a paper 10 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 13 days ago • 63

KevinQHLin

submitted a paper to Daily Papers 11 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 13 days ago • 63

KevinQHLin

authored 2 papers 11 days ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Paper • 2503.15661 • Published Mar 19 • 2

KevinQHLin

authored a paper about 1 month ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19 • 52

KevinQHLin

authored a paper about 2 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10 • 105

HideOnBush

authored 2 papers about 2 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10 • 105

InteracSPARQL: An Interactive System for SPARQL Query Refinement Using Natural Language Explanations

Paper • 2511.02002 • Published Nov 3 • 1

KevinQHLin

authored a paper about 2 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4 • 101

weipang142857

authored 3 papers about 2 months ago

Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning

Paper • 2405.19600 • Published May 30, 2024

DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models

Paper • 2404.05083 • Published Apr 7, 2024

LazyVLM: Neuro-Symbolic Approach to Video Analytics

Paper • 2505.21459 • Published May 27

HideOnBush

authored a paper about 2 months ago

The Underappreciated Power of Vision Models for Graph Structural Understanding

Paper • 2510.24788 • Published Oct 27 • 35

weipang142857

authored a paper about 2 months ago

The Underappreciated Power of Vision Models for Graph Structural Understanding

Paper • 2510.24788 • Published Oct 27 • 35

HideOnBush

authored 6 papers 2 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3 • 39

Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning

Paper • 2405.19600 • Published May 30, 2024

Communication-Efficient Decentralized Online Continuous DR-Submodular Maximization

Paper • 2208.08681 • Published Aug 18, 2022

Roughness Index for Loss Landscapes of Neural Network Models of Partial Differential Equations

Paper • 2103.11069 • Published Mar 20, 2021

DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models

Paper • 2404.05083 • Published Apr 7, 2024