OpenPecha is an etext and annotations store made available on GitHub and through a set of APIs.
The project’s primary aim is to facilitate the collection, proofreading, and enrichment of etexts by leveraging language technology and collaboration.
New to OpenPecha? Here are a few places to get started using our data and tools:
Download a featured dataset
Get the latest Pecha datasets to train Tibetan-language AI models.
OCR scanned BDRC books
Use the OCR Pipeline to OCR scans in the BDRC collection.
Get Pecha Toolkit
pipand get up and running in minutes.
Get the latest news
Read our blog to learn the latest from OpenPecha and the Tibetan AI space.