Saturday, April 14, 2007

WikiOntology Idea

I'm envisioning a web-deliverable system for the PropBank and FrameNet data that allows users to modify or ammend relations between the arguments for purposes of getting as many of these binary subpredicates into place as possible. Considering there's thousands, some sort of wiki-tech might be useful, allowing users to download the up-to-date dataset. Basically, there would be a UI that indicated the arguments for each n-ary predicate or frame and would allow users to ammend relationship types between them. Users would be able to use URI from other domains or create new ones onto the project's domain.

Given the sentence "The boy went to the store on his bike.", we might obtain from some parsers a predicate resembling "went(the boy, to the store on his bike)". The idea is to allow user input to get a dataset matching that sentence to a set of predicates also including "utilized(the boy, his bike, [to go] to the store)", "owns(the boy, [a] bike)". Some sort of XML output would capture this and a UI allowing pairwise connections between noun arguments would allow users to add all the knowledge contained in a sentence. Ideally, these predicates would be further simplied until only simple nouns were interrelated. The idea is to use wiki-collaboration to create a dataset for SRL algorithms.

No comments: