ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper • 2105.13626 • Published May 28, 2021 • 5
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 27 days ago • 93
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 23 days ago • 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 51
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 • 88