Ines Montani
|
4dcfafde02
|
Add line breaks
|
2016-11-24 14:57:37 +01:00 |
|
Ines Montani
|
de747e39e7
|
Reformat language data
|
2016-11-24 13:51:32 +01:00 |
|
Ines Montani
|
dad2c6cae9
|
Strip trailing whitespace
|
2016-11-20 16:45:51 +01:00 |
|
Matthew Honnibal
|
f0917b6808
|
Fix Issue #376: and/or was tagged as a noun.
|
2016-11-04 15:21:28 +01:00 |
|
Matthew Honnibal
|
737816e86e
|
Fix #368: Tokenizer handled pattern 'unicode close quote, period' incorrectly.
|
2016-11-04 15:16:20 +01:00 |
|
Matthew Honnibal
|
41a90a7fbb
|
Add tokenizer exception for 'Ph.D.', to fix 592.
|
2016-11-03 00:03:34 +01:00 |
|
Matthew Honnibal
|
e7414cd064
|
Try to fix weird install glitch.
|
2016-10-23 19:46:28 +02:00 |
|
Matthew Honnibal
|
622b0a9674
|
Tweak download script
|
2016-10-19 00:52:16 +02:00 |
|
Matthew Honnibal
|
edc45c19d6
|
Update download script
|
2016-10-19 00:41:14 +02:00 |
|
Matthew Honnibal
|
8c8f5c62c6
|
Add LANG attribute to English and German
|
2016-10-18 18:52:48 +02:00 |
|
Matthew Honnibal
|
ea23b64cc8
|
Refactor training, with new spacy.train module. Defaults still a little awkward.
|
2016-10-09 12:24:24 +02:00 |
|
Matthew Honnibal
|
7db956133e
|
Move tokenizer data for German into spacy.de.language_data
|
2016-09-25 15:37:33 +02:00 |
|
Matthew Honnibal
|
95aaea0d3f
|
Refactor so that the tokenizer data is read from Python data, rather than from disk
|
2016-09-25 14:49:53 +02:00 |
|
Matthew Honnibal
|
d7e9acdcdf
|
Add English language data, so that the tokenizer doesn't require the data download
|
2016-09-25 14:49:00 +02:00 |
|
Matthew Honnibal
|
fd65cf6cbb
|
Finish refactoring data loading
|
2016-09-24 20:26:17 +02:00 |
|
Henning Peters
|
470cdf5bf9
|
remove deprecated LOCAL_DATA_DIR
|
2016-04-05 11:25:54 +02:00 |
|
Henning Peters
|
a7d7ea3afa
|
first idea for supporting multiple langs in download script
|
2016-03-24 11:19:43 +01:00 |
|
Henning Peters
|
9cc4f8d5b3
|
avoid shadowing __name__
|
2016-02-15 01:33:39 +01:00 |
|
Matthew Honnibal
|
445164d5b4
|
* Restore the LOCAL_DATA_DIR global in spacy/en/__init__.py, although this is now deprecated
|
2016-01-19 02:54:56 +01:00 |
|
Henning Peters
|
5551052840
|
fix py2/3 issue
|
2016-01-16 12:44:53 +01:00 |
|
Henning Peters
|
235f094534
|
untangle data_path/via
|
2016-01-16 12:23:45 +01:00 |
|
Henning Peters
|
211913d689
|
add about.py, adapt setup.py
|
2016-01-15 18:57:01 +01:00 |
|
Henning Peters
|
780cb847c9
|
add default_model to about
|
2016-01-15 18:07:15 +01:00 |
|
Henning Peters
|
788f734513
|
refactored data_dir->via, add zip_safe, add spacy.load()
|
2016-01-15 18:01:02 +01:00 |
|
Henning Peters
|
9b75d872b0
|
fix model download
|
2016-01-14 12:02:56 +01:00 |
|
Matthew Honnibal
|
187960606f
|
* Fix pickle problems
|
2015-12-28 16:54:03 +01:00 |
|
Henning Peters
|
32d655b6e1
|
bump version
|
2015-12-28 09:34:39 +01:00 |
|
Matthew Honnibal
|
8b61d45ed0
|
* Fix merge conflicts for headers branch
|
2015-12-27 17:46:25 +01:00 |
|
Henning Peters
|
0e321a7105
|
get mingw32 to work
|
2015-12-22 23:25:38 +01:00 |
|
Henning Peters
|
8359bd4d93
|
strip data/ from package, friendlier Language invocation, make data_dir backward/forward-compatible
|
2015-12-18 09:52:55 +01:00 |
|
Henning Peters
|
970278a3d6
|
no need to link data dir anymore
|
2015-12-18 09:49:45 +01:00 |
|
Henning Peters
|
2d4efe40f9
|
fix sputnik call
|
2015-12-13 14:46:08 +01:00 |
|
Henning Peters
|
ac318b568c
|
new approach to dependency headers
|
2015-12-13 11:49:17 +01:00 |
|
Henning Peters
|
9027cef3bc
|
access model via sputnik
|
2015-12-07 06:01:28 +01:00 |
|
Henning Peters
|
73e5650be5
|
change index server
|
2015-11-18 18:09:46 +01:00 |
|
Henning Peters
|
50d15ea5d2
|
fix
|
2015-11-18 17:35:21 +01:00 |
|
Henning Peters
|
919a4f0b04
|
change data path, add repository
|
2015-11-18 11:40:46 +01:00 |
|
Henning Peters
|
12de895e60
|
fix version
|
2015-11-15 16:38:16 +01:00 |
|
Henning Peters
|
03d2f98cd5
|
add sputnik
|
2015-11-15 15:58:21 +01:00 |
|
Matthew Honnibal
|
3b74739c3e
|
* Download updated data
|
2015-11-08 21:24:25 +11:00 |
|
Matthew Honnibal
|
ffedff9e6c
|
* Remove the archive after download, to save disk space
|
2015-11-03 18:54:05 +11:00 |
|
Matthew Honnibal
|
ff4fe524ee
|
* Fix exception for python 2
|
2015-10-23 01:56:13 +02:00 |
|
Matthew Honnibal
|
341a3e85cd
|
* Upd downloaded data version
|
2015-10-23 00:56:57 +02:00 |
|
Henning Peters
|
ccffd2ef53
|
fixed extract directory
|
2015-10-21 07:59:34 +02:00 |
|
Henning Peters
|
da4c9cee06
|
assert filename match
|
2015-10-20 19:33:59 +02:00 |
|
Henning Peters
|
4f703f0cb4
|
better error reporting, cleanup
|
2015-10-20 19:11:29 +02:00 |
|
Matthew Honnibal
|
9cdea6e450
|
* Import uget correctly
|
2015-10-19 08:32:41 +02:00 |
|
Henning Peters
|
bfde91fa49
|
add custom download tool (uget), replace wget with uget
|
2015-10-18 12:35:04 +02:00 |
|
Matthew Honnibal
|
e886e6a406
|
* Inc version
|
2015-10-13 13:46:17 +11:00 |
|
Matthew Honnibal
|
a3dfe2b901
|
* Increment data version
|
2015-10-09 13:26:17 +02:00 |
|