Qwen Image Edit (exps) Collection adapter LoRA developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model • 8 items • Updated about 20 hours ago • 2
YOLO26 Models Collection YOLO26 models: detection, segmentation, classification, pose, and OBB variants with demo Space. • 31 items • Updated 4 days ago • 15
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5 Paper • 2601.10527 • Published 4 days ago • 19
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 4 days ago • 18
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 11 days ago • 45
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 11 days ago • 194
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 26 days ago • 29
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 16 items • Updated 3 days ago • 18
Qwen Image Edit (Object-Manipulator) Collection Add or remove the specified objects, flexible for both single-image and multi-image modes. • 2 items • Updated 15 days ago • 9
view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU 17 days ago • 12
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 20 days ago • 49
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 19 days ago • 136
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published 23 days ago • 44