Improve model card: Update pipeline tag, add comprehensive details and demos
#2
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for WebDancer by:
- Updating the
pipeline_tagfromtext-generationtoimage-text-to-textto accurately reflect the model's multimodal capabilities in processing visual inputs (e.g., GUI screenshots) for web interaction. This ensures the model is discoverable under the correct pipeline at https://huggingface.co/models?pipeline_tag=image-text-to-text. - Adding relevant
tagssuch asweb-agent,gui-agent,multimodal,reinforcement-learning, andreactfor improved discoverability and context. - Including the paper abstract for a concise overview.
- Explicitly linking to the GitHub repository for code and project details.
- Incorporating detailed "Features", "Quick Start" instructions, and embedded "Demos" from the project's GitHub README to provide a comprehensive understanding and easier usability.
- Adding the academic citation for proper attribution.
This update makes the model card more informative, accessible, and aligned with Hugging Face Hub best practices.