Weakly Supervised RE
The idea here is to either:
- start out with a set of hand-crafted rules and automatically find new ones from the unlabeled text data, through and iterative process (bootstrapping)
- start out with a sed of seed tuples, describing entities with a specific relation (e.g. seed={(ORG:IBM, LOC:Armonk), (ORG:Microsoft, LOC:Redmond)} states entities having the relation “based in”)
|
|
Snowball is a fairly old example of an algorithm which does this:
|
Example
- seed tuple:
<Mark Twain, Elmira> - grep/google for the environments of the seed tuple:
Mark Twain is buried in Elmira, NYX is buried in Y
The grave of Mark Twain is in ElmiraThe grave of X is in Y
Elmira is Mark Twain’s final resting placeY is X’s final resting place
- use those patterns to grep for new tuples
- iterate
---cognitive-computing---machine-intelligence/ai---subfields/natural-language-processing-(nlp)---computational-linguistics/information-retrieval-(ir)---information-extraction-(ie)/entity-relation-extraction-(re)/weakly-supervised-re-(bootstrapping)/weakly-supervised-relation-extraction.png)