|
AGFL Grammar
for the
Russian Language |
|
The Rus4IR Grammar consists of two levels: a level of context-free rules,
extended with set-valued features (also described by Context-Free rules) which
express agreement between constituents. For complete coverage of the topic see
the official AGFL
site.
Being a language with comparatively free word order, Russian needs an
intricate system of rules for producing a good analysis, more complicated than
for English. Furthermore, the Rus4IR Grammar is intended for Information
Retrieval purposes, hence has been made robust to unknown or misspelled words
and ill-formed sentences. The grammar consists of a number of modules designed
to represent various structures of the language:
- The root grammar, which depicts the basics of morphology and syntax of Russian, and
includes (uses) other modules. One can use either the grammar designed
for morphological analysis of a random word, or the version for morphosyntactical analysis of given text
(with local syntax). For switching between the modes the root grammar file should be edited in order to state which language
unit is to be used as a segment ("oneword" for morphological and "phrase" for morphosyntactical analysis).
- general group structure for the Russian language;
- a module defining the structure of
nounphrases;
- a module defining the structure of
verbphrases;
- modules describing the morphology of verbs, nouns, pronouns and adjectives;
- a module defining the so-called global affixes;
- a module defining standard prefixes for Russian;
- modules describing regular inflections for the primary parts of speech (verbs, nouns and adjectives);
- a module defining irregular forms for secondary parts of
speech (numerals, pronouns);
You can download a zip-file containing the current grammar
files here
back to Main Page
back to AGFL Main
Page