Pixel-based Encoding of Language

Recipient
Desmond Elliott
University of Copenhagen
Project number:
00053122
Grant amount
5.990.311 DKK
Year
2022

Project description

Natural language processing plays an increasingly important part of our digital lives but it works best for high-resource languages due to the amount of data needed to train models. This project will create new models that can process any language rendered in an image, based on the insight that written languages share visual similarities, resulting in high-quality models for thousands of languages. This grant will fund two PhD students, one postdoc, and equipment.