NoCaCoDa logo

Лаборатория типологического изучения языков    Институт лингвистических исследований РАН

Instructions for contributors

Language contributors are linguists with expertise in the target language whose task is to elicit the necessary translations from native speakers and to provide them in the form accessible to "coders". Contributor’s main goal is to provide adequate translations of stimulus sentences into the target language and to identify nominal causal markers employed (if any).

Main adequateness criteria are semantic precision (the same meaning as in the stimulus sentence) and naturalness. Equivalents that fail to meet these criteria do not make their way into the database (but can be mentioned in comments).

Glosses should be provided in a separate line (no alignment needed). Grammatical labels must be in the upper case (ERG, PST, etc.).

If the sentence obtained is semantically different from the questionnaire stimulus, contributors are expected to comment on this and to provide a back translation.

Alternative translational equivalents are welcome. If a given stimulus sentence has more than one equivalent, one of the equivalents must be marked as “main” (other equivalents will be marked “alternate”) based on their naturalness / frequency.

Instructions for coders

A coder is a representative of the NoCaCoDa database who interacts with language contributors and is responsible for preparing spreadsheets that comply with the database’s substantive and technical conventions. The coder’s interaction with language contributors should result in two spreadsheets per language:

1) languagename_data.csv (e.g., “French_data.csv”): entries in this spreadsheet correspond to individual translations of the questionnaire sentences into the target language.

2) languagename_markers.csv (e.g., “French_markers.csv”): entries in this spreadsheet correspond to all distinct nominal causal markers found in theprevious spreadsheet.

Preparing the spreadsheet with translations (languagename_data.csv)

The following fields must be present in the spreadsheet:

Preparing the spreadsheet with nominal causal markers (languagename_markers.csv)

The following fields must be present in the spreadsheet: