Status Presentation - WebKnox: Approaches for Automatically Building a Knowledge Base from the Web (Statusvortrag)
- Datum
- 06.09.2011
- Zeit
- 14:00 - 15:00
- Sprecher
- Dipl.-Medieninf. David Urbansky
- Zugehörigkeit
- Institut für Systemarchitektur, Lehrstuhl Rechnernetze
- Sprache
- en
- Hauptthema
- Informatik
- Andere Themen
- Informatik
- Beschreibung
- Recent studies have shown that more than half of the queries on search engines are about entities such as people, products, or places. Today's search engines, however, do not excel in answering those queries with entity-centric results, but rather with documents that are about those entities. For this reason, the user has the burden of clicking through different search results to gather information about the entity in which he is interested. Having structured information about entities is also of great use in the context on the Web of Data where information is stored in a machine-readable manner and thus users' search intents can be answered more precisely. In order to provide users with aggregated information about these entities of interest a large collection of entities has to be built. This collection must be updated continuously because new entities (for example, products such as mobile phones) are released almost on a daily basis. Building and maintaining such a knowledge base manually requires substantial effort and does not scale well when entities from many different domains are targeted. Today only a few aggregators exist that extract entity names from web pages, enrich them with facts, and publish them as Linked Data. Two such aggregators are DBpedia and Freebase. These systems rely, however, on very few sources (mostly Wikipedia), on manually curated data, or on direct user input. The goal of WebKnox is to extract entity names from different domains from the Web with as little manual effort as possible. Each entity is then enriched with more information, such as facts, questions/answers, and multimedia objects to provide a good overview of what each entity resembles. The contributions of WebKnox are extraction and assessment techniques to automatically create a large database of entities from the World Wide Web. The results can be used in multiple practical applications, including question answering, resolving entity-centric search questions, and improving named entity recognition Betreuer: Prof. Dr. rer. nat. habil. Dr. h. c. Alexander Schill Fachreferent: Prof. Dr.-Ing. Michael Schroeder
Letztmalig verändert: 06.09.2011, 09:35:34
Veranstaltungsort
TUD Andreas-Pfitzmann-Bau (Informatik) (INF 1004 (Ratssaal))Nöthnitzer Straße4601069Dresden
- Homepage
- https://navigator.tu-dresden.de/etplan/apb/00
Veranstalter
TUD InformatikNöthnitzer Straße4601069Dresden
- Telefon
- +49 (0) 351 463-38465
- Fax
- +49 (0) 351 463-38221
- Homepage
- http://www.inf.tu-dresden.de
Legende
- Ausgründung/Transfer
- Bauing., Architektur
- Biologie
- Chemie
- Elektro- u. Informationstechnik
- für Schüler:innen
- Gesellschaft, Philos., Erzieh.
- Informatik
- Jura
- Maschinenwesen
- Materialien
- Mathematik
- Medizin
- Physik
- Psychologie
- Sprache, Literatur und Kultur
- Umwelt
- Verkehr
- Weiterbildung
- Willkommen
- Wirtschaft