Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

PeterLee6094's picture

1

PeterLee6094

PeterLee6094

·

AI & ML interests

NLP && CV

Organizations

None yet

Collections 2

Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process

Paper • 2405.11870 • Published May 20, 2024
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments

Paper • 2501.01652 • Published Jan 3
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 18
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

Paper • 2502.08130 • Published Feb 12 • 9

Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process

Paper • 2405.11870 • Published May 20, 2024
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments

Paper • 2501.01652 • Published Jan 3
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published Dec 12, 2024 • 10

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 34
Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 18
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models

Paper • 2502.08130 • Published Feb 12 • 9

models 1

PeterLee6094/UltralHermes-2.5-Mistral-7B

Updated Apr 23 • 1

datasets 0

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs