Sunday, April 15, 2007

Website Design

For this website for collaborative generation of a linguistic corpus and ontology, I'm thinking about a drag and drop based UI, where users can work with sentences and, using gestures, enter the knowledge obtained from reading them.

I downloaded phalanger for PHP integration into Visual Studio and .NET (they also have one for Mono). I'm looking at prototype and mootools for the javascript library. Also found this extension to mediawiki that adds web services to wiki (SOAP, WSDL) — I'll probably code some web services into it.

I'm thinking that the site will show articles with entities (nouns) as draggable objects that can be manipulated in the environment to allow users to easily provide the knowledge they obtain from reading (sentence by sentence). This will include anaphoric resolution and semantic relations between entities. By processing entire articles, context will be obtainable from the dataset. Hopefully the interface will be intuitive enough to encourage users to provide an ever-improving downloadable resource comparable to penn treebank and redwoods.

No comments: