KB Research enrichment infrastructure
The KB Research enrichment infrastructure consists of an enrichment database and a number of services
, some of which can be used by external applications.
Some results of our current enrichment processes, i.e. links to related information in Dutch historical newspapers and radio broadcasts, are demonstrated by means of a very basic experimental portal at http://www.kbresearch.nl/xportal
. These enrichments are shown in the left upper corner.
Enrichments have the same the same identifier as the objects they enrich and can also be requested via an URL, for example:http://tomcat.kbresearch.nl/links/ir?id=http://resolver.kb.nl/resolve?urn=ddd:010135660:mpeg21:a0026:ocr
There are two basic types of resources that can be enriched:
Non-information resources (NIR)
These are resources that can have more than one description at different locations like things, concepts, persons etc. Only the description, not the resource itself, can be requested via an URL.
Information resources (IR)
These are digital content objects like newspaper articles, book pages etc. They can be requested and presented by means of a URL.
There are many different types of enrichments possible for the information resources. In some cases these enrichments refer to only fragment of the content. Some examples:
Related web pages, videos etc.(see video example )
Links to resource descriptions for named entities in text objects (see video example )
Geographical coordinates of a location or an event in the text
Place and street name where an event took place
Extracted features from a text like sentiment
Different types of enrichments require different data fields. The current overview is given at enrichment_fields.html
. This list may change over time, when new types of enrichments are introduced. The algorithms used in enrichments are listed at references.html