This study describes the on-going development of the finite-state description for an endangered minority language, KomiZyrian. This work is located in the context where large written and spoken language corpora are available, which creates a set of unique challenges that have to be, and can be, addressed.
We describe how we have designed the transducer so that it can benefit from existing open-source infrastructures and therefore be as reusable as possible.