svlandeg
|
694fea597a
|
dumping all entryC entries + (inefficient) reading back in
|
2019-04-23 18:36:50 +02:00 |
|
svlandeg
|
8e70a564f1
|
custom reader and writer for _EntryC fields (first stab at it - not complete)
|
2019-04-23 16:33:40 +02:00 |
|
svlandeg
|
004e5e7d1c
|
little fixes
|
2019-04-19 14:24:02 +02:00 |
|
svlandeg
|
9a8197185b
|
fix alias capitalization
|
2019-04-18 22:37:50 +02:00 |
|
svlandeg
|
9f308eb5dc
|
fixes for prior prob and linking wikidata IDs with wikipedia titles
|
2019-04-18 16:14:25 +02:00 |
|
svlandeg
|
10ee8dfea2
|
poc with few entities and collecting aliases from the WP links
|
2019-04-18 14:12:17 +02:00 |
|
svlandeg
|
6763e025e1
|
parse wp dump for links to determine prior probabilities
|
2019-04-15 11:41:57 +02:00 |
|
svlandeg
|
3163331b1e
|
wikipedia dump parser and mediawiki format regex cleanup
|
2019-04-14 21:52:01 +02:00 |
|
svlandeg
|
b31a390a9a
|
reading types, claims and sitelinks
|
2019-04-11 21:42:44 +02:00 |
|
svlandeg
|
6e997be4b4
|
reading wikidata descriptions and aliases
|
2019-04-11 21:08:22 +02:00 |
|