Collections
Discover the best community collections!
Collections including paper arxiv:2504.04842
-
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper ⢠2412.01106 ⢠Published ⢠24 -
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper ⢠2412.04448 ⢠Published ⢠10 -
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper ⢠2412.14963 ⢠Published ⢠6 -
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Paper ⢠2502.01061 ⢠Published ⢠222
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper ⢠2503.19910 ⢠Published ⢠15 -
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing
Paper ⢠2503.21541 ⢠Published ⢠1 -
HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration
Paper ⢠2504.03536 ⢠Published ⢠13 -
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Paper ⢠2504.04842 ⢠Published ⢠35
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper ⢠2412.11279 ⢠Published ⢠13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper ⢠2501.02260 ⢠Published ⢠5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper ⢠2501.09978 ⢠Published ⢠6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper ⢠2502.13995 ⢠Published ⢠9
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
CoLLM: A Large Language Model for Composed Image Retrieval
Paper ⢠2503.19910 ⢠Published ⢠15 -
LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing
Paper ⢠2503.21541 ⢠Published ⢠1 -
HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration
Paper ⢠2504.03536 ⢠Published ⢠13 -
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Paper ⢠2504.04842 ⢠Published ⢠35
-
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
Paper ⢠2412.11279 ⢠Published ⢠13 -
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Paper ⢠2501.02260 ⢠Published ⢠5 -
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor
Paper ⢠2501.09978 ⢠Published ⢠6 -
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
Paper ⢠2502.13995 ⢠Published ⢠9
-
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper ⢠2412.01106 ⢠Published ⢠24 -
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper ⢠2412.04448 ⢠Published ⢠10 -
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper ⢠2412.14963 ⢠Published ⢠6 -
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Paper ⢠2502.01061 ⢠Published ⢠222
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2