Dependency Parsing
Dependency Parsing is the process to analyze the grammatical structure in a sentence and find out related words as well as the type of the relationship between them.
medtext utilizes the Universal Dependency Graph (UDG) to describe the grammatical relationships in a sentence. UDG is a directed graph, which represents all universal dependency information in a sentence. The vertices in a UDG represent the information such as the word, part-of-speech and the word lemma. The edges in a UDG represent the typed dependencies from the governor to its dependent and are labeled with the corresponding dependency type. UDG effectively represents the syntactic head of each word in a sentence and the dependency relation between words.
spaCy: See
Text preprocessing>spaCyStanza: See
Text preprocessing>StanzaBllip: medtext obtains the universal dependencies by applying the Stanford dependency converter with the
CCProcessedandUniversaloption.
Usage:
medtext-tree2dep [options] -i FILE -o FILE
Options:
-i FILE Inpput file
-o FILE Output file
--overwrite Overwrite the existing file
from medtext_parse.models.tree2dep import BioCPtb2DepConverter
converter = BioCPtb2DepConverter()