Tianyu Zhang's picture

4 23 8

Tianyu Zhang

TianyuZhang

·

https://ai.t-zhang.com

AI & ML interests

Vision Language Modeling, Model Merging, Zero Sum Game, Climate Change

Recent Activity

upvoted a paper 12 days ago

The Underappreciated Power of Vision Models for Graph Structural Understanding

liked a model 16 days ago

ByteDance/Ouro-2.6B-Thinking

liked a model 16 days ago

ByteDance/Ouro-2.6B

View all activity

Organizations

authored 11 papers 17 days ago

ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods

Paper • 2110.02871 • Published Oct 6, 2021

MuPT: A Generative Symbolic Music Pretrained Transformer

Paper • 2404.06393 • Published Apr 9, 2024 • 16

Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

Paper • 2211.06687 • Published Nov 12, 2022 • 4

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 14

STRICT: Stress Test of Rendering Images Containing Text

Paper • 2505.18985 • Published May 25

A Single Merging Suffices: Recovering Server-based Learning Performance in Decentralized Learning

Paper • 2507.06542 • Published Jul 9

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

Paper • 2406.07529 • Published Jun 11, 2024

Improving GUI Grounding with Explicit Position-to-Coordinate Mapping

Paper • 2510.03230 • Published Oct 3 • 3

Chronological Thinking in Full-Duplex Spoken Dialogue Language Models

Paper • 2510.05150 • Published Oct 2

Scope: Selective Cross-modal Orchestration of Visual Perception Experts

Paper • 2510.12974 • Published Oct 14

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 18 days ago • 211

authored a paper 9 months ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3 • 39

authored a paper over 1 year ago

VCR: Visual Caption Restoration

Paper • 2406.06462 • Published Jun 10, 2024 • 13