Appen is creating a pool of qualified native Pashto speakers (as spoken in Afghanistan), with academic background in Linguistics for on-going linguistic projects. This registration may not result in immediate engagement in one of our projects, but we will reach out to you if you are shortlisted, and once the project is scheduled to start.
You may be asked to complete lingiustic screeners so we can assess your skills better, and decide which task may be the right match.
Most opportunities are work from home, contract based projects.
The work will involve assisting in the creation of the Lexicon component of a Speech Database (SDB). A Lexicon is an electronic pronunciation dictionary (in text format) consisting of phonetic representations of all words transcribed from audio files in a SDB. The lexicons also include syllable boundaries and stress or tone mark-up. They also usually include variant pronunciations, dialectal or regional variation and can include additional labelling indicating ‘foreign word’, ‘Proper Name’, ‘acronym’, ‘homonym’, etc.
The work will also involve creating documents to describe the language for the purpose of language technology development and assisting with resolution of any spelling standardisation or dialectal variation issues in the language. The majority of the linguist’s time will be spent checking and correcting transcriptions of words in phonetic script. Training in Appen processes will be provided.
Preferred Knowledge, Skills and Abilities: