Olam Dictionary Conversion Project

I’ve started a project to import digitized and transcribed dictionaries into Olam. Here’s the repo: GitHub - beniza/dict-dictpress-conversions

Anyone who’s interested is invited to join the project.

Sabdatharavali data has already been transformed and is now ready to be imported (it’s not fully tested, but Kailash looked at a sample and confirmed that it looks okay).

I’m currently working on Bailey’s 1849 English-Malayalam Dictionary. I have also started processing Hermann Gundert’s Dictionary.

2 Likes

This is great @benVar!

We can stick to the src:$dict tag (tag field) instead of meta JSON. Eg: src:gundert, src:bailey etc.

Semantically, tagging is the better choice here than meta to mark the source. Tags also have first-class support in the search API.

1 Like