Scrape Tatort episode descriptions from http://www.tatort.de/ and turn it into RDF.

Marcus Rohrmoser 9944353a0d homepage url 6 months ago
bin 1e1f051386 ignore duplicate „903 Kopfgeld“. fixes #2 3 years ago
htdocs a935e98de0 detab 4 years ago
.gitignore 1801d5aec9 workaround missing episode ‚857 Scheinwelten‘ (on website) 3 years ago
LICENSE.txt 7ca86b219e typo 4 years ago
README.md 39bdc340c4 use text() context to find production company. fixes #1 4 years ago
doap.rdf 9944353a0d homepage url 6 months ago

README.md

Scrape Tatort episode descriptions from the website and turn it into RDF.

Usage

Just run

$ sh bin/run.sh

and find 3 rdf files in htdocs.

License

see LICENSE.txt

Preliminaries

No fancy script language, no database, no json.

Just xsltproc rapper and roqet

OS X

$ brew install raptor rasqal

Debian

$ sudo apt-get install make xsltproc raptor2-utils rasqal-utils

Example SPARQL queries

Movies per kommissar

$ roqet http://tatort.rdf.mro.name/examples/kommissare.rq --results csv

Actor appearances with character name

Expect loooong execution time (several minutes).

$ roqet http://tatort.rdf.mro.name/examples/actor.rq --results csv