arxiv:2502.09604
							
						Shannon Shen
shannons
		AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						21 days ago
						
					
						
						
						SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
						
						published
								a dataset
							
						3 months ago
						
					
						
						
						
						shannons/ot3-1.2m-10k
						
						updated
								a dataset
							
						3 months ago
						
					
						
						
						
						rl-rag/combined-sft-training-data-v20250724