The Curious Case of Neural Text Degeneration
			Paper
			•
			1904.09751
			•
			Published
				
			•
				
				3
			
Figure 5: The probability mass assigned to partial human sentences. Flat distributions lead to many moderately probable tokens, while peaked distribut