alwaysaditi's picture
End of training
dc78b20 verified
a novel use of statistical parsing to extract information from text since 1995, a few statistical parsing algorithms have demonstrated a breakthrough in parsing accuracy, as measured against the upenn treebank as a gold standard. in this paper we report adapting a lexicalized, probabilistic context-free parser to information extraction and evaluate this new technique on muc-7 template elements and template relations. our rule-based methods employ a number of linguistic rules to capture relation patterns. we take the view that relation extraction is just a form of probabilistic parsing where parse trees are augmented to identify all relations. we integrate various tasks such as part-of-speech tagging, named entity recognition, template element extraction and relation extraction, in a single model. we combine entity recognition, parsing, and relation extraction into a jointly-trained single statistical parsing model that achieves improved performance on all the subtasks. part of the contribution of the current work is to suggest that joint decoding can be effective even when joint training is not possible because jointly-labeled data is unavailable.