Semantic Relation Extraction
- extracting semantics/semantic relations from unstructured text data
An Approach to Lexical-Syntactic Pattern Discovery
- pick one Semantic Relation at a time
- find lexico-syntactic patterns that express that SR
- create a corpus with positive and negative examples
- learn constraints that discover the SR
- evaluate performance: precision and recall
Lexical-Syntactic Patterns Expressing Meronymy
Variety of meronymy expressions
|
Explicit Part-Whole Constructions |
|
|
Implicit Part-Whole Constructions |
|
|
Manual Inspection of Pattern Types |
|
Get pattern statistics and select the most frequent ones:
NP1 of NP2NP1’s NP2NP1 have NP2
Meronymy Learning Algorithm
- input: positive and negative meronymic examples of concept pairs
- output: semantic constraints on concepts
algorithm:
- Generalize the training examples:
- example:
{part#sense; whole#sense; target} - generalized example:
{part#sense, classpart#sense; whole#sense, classwhole#sense; target} - e.g.
{aria#1, entity#1; opera#1, abstraction#6; Yes} - types of examples:
- positive:
{Xclass#sense; Yclass#sense; Yes} - negative:
{Xclass#sense; Yclass#sense; No} - ambiguous:
{Xclass#sense; Yclass#sense; Yes/No}
- positive:
- example:
- Learn constraints for the unambiguous examples (positive and negative)
- Specialize the ambiguous examples:
- based on the IS-A information provided by WordNet
- initially, each semantic class represented the root of one of the noun hierarchies in WordNet
- Go down in the hierarchy and specialize the semantic classes that correspond to the part, respectively whole
- repeat the specialization until no more ambiguous examples in the training corpus