Ultra-FineWeb-classifier / assets /ultra-fineweb-pipeline.png
BigDong's picture
add Ultra-FineWeb lighteval task python file
4ed02d8
ultra-fineweb-pipeline.png

Large File Pointer Details

( Raw pointer file )
SHA256:
f690af4e2e1a5e7e319f64f5bad27ee955371cd9feebb620262b5e12c0c17cb2
Pointer size:
131 Bytes
·
Size of remote file:
279 kB
·
Xet backed hash:
f74459ede37e629d45bc271a60314dca6261dd22c12c07e56d33778169c1c77a

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.