Commit Graph

5252 Commits

Author SHA1 Message Date
ines
e6f850f014 Add pip to requirements.txt and setup.py (resolves #1064) 2017-05-16 14:45:15 +02:00
ines
9003fd25e5 Fix error messages if model is required (resolves #1051)
Rename about.__docs__ to about.__docs_models__.
2017-05-13 13:14:02 +02:00
ines
24e973b17f Rename about.__docs__ to about.__docs_models__ 2017-05-13 13:09:00 +02:00
ines
6e1dbc608e Fix parse_tree test 2017-05-13 12:34:20 +02:00
ines
573f0ba867 Replace deepcopy 2017-05-13 12:34:14 +02:00
ines
bd428c0a70 Set defaults for light and flat kwargs 2017-05-13 12:34:05 +02:00
ines
c5669450a0 Fix formatting 2017-05-13 12:33:57 +02:00
Matthew Honnibal
ad590feaa8 Fix test, which imported English incorrectly 2017-05-13 11:36:19 +02:00
ines
e506811a93 Update description 2017-05-13 03:27:50 +02:00
Ines Montani
9e292e822e Merge pull request #1047 from pasupulaphani/patch-1
Update _data.json
2017-05-13 03:24:38 +02:00
Ines Montani
8d742ac8ff Merge pull request #1055 from recognai/master
Enable pruning out rare words from clusters data
2017-05-13 03:22:56 +02:00
Matthew Honnibal
12158cc06f Merge branch 'kengz-master' 2017-05-13 03:19:18 +02:00
Matthew Honnibal
b2540d2379 Merge Kengz's tree_print patch 2017-05-13 03:18:49 +02:00
oeg
cdaefae60a feature(populate_vocab): Enable pruning out rare words from clusters data 2017-05-12 16:15:19 +02:00
Phaninder Pasupula
953f638aa5 Update _data.json 2017-05-08 11:48:05 +01:00
ines
76ebd0fe5c Update CONTRIBUTING.md 2017-05-07 18:37:36 +02:00
ines
229b8c3974 Tidy up 2017-05-07 18:36:35 +02:00
ines
a793174ae9 Use setuptools.find_packages() 2017-05-03 20:11:02 +02:00
ines
fac3566aac Add descriptions to POS tagging scheme 2017-05-03 20:11:02 +02:00
ines
1570b83ee5 Add spacy.explain() note to NER annotation scheme 2017-05-03 20:11:02 +02:00
ines
219369bb7d Add detailed docs for dependency label annotations 2017-05-03 20:11:02 +02:00
ines
0de98472b3 Increment CSS version 2017-05-03 20:11:02 +02:00
ines
7631d08d67 Adjust saturation of light theme color 2017-05-03 20:11:02 +02:00
ines
06e414b3fc Don't wrap inline code 2017-05-03 20:11:02 +02:00
ines
41c6085a6c Add pos-row and dep-row mixins to global mixins 2017-05-03 20:11:02 +02:00
ines
b1f22c5a10 Fix formatting 2017-05-03 20:11:02 +02:00
Matthew Honnibal
26ac77517f Merge pull request #1039 from akYoung/akYoung-patch-1-1
Corretions for model test example
2017-05-03 18:47:54 +02:00
ines
a04b5be1b2 Add glossary for annotation scheme (closes #1034)
Can be imported as explain from spacy.glossary, or called as
spacy.explain(term)
2017-05-03 17:02:17 +02:00
akYoung
c158cdb1da Corretions for model test example
The sentences of test data in sentence entailment example should be generated with integers limited to vocab_size.
2017-05-03 22:41:23 +08:00
Ines Montani
6e1fad92a1 Update CONTRIBUTORS.md 2017-05-03 10:01:40 +02:00
ines
e2380d8789 Update README.rst 2017-05-03 10:00:04 +02:00
ines
f9384b0fbd Update alpha languages and add aside for tokenizer dependencies 2017-05-03 09:58:31 +02:00
Ines Montani
f0d7a87e18 Merge pull request #1035 from uetchy/japanese-support
Japanese support
2017-05-03 09:44:54 +02:00
Ines Montani
3ea23a3f4d Fix formatting 2017-05-03 09:44:38 +02:00
Ines Montani
d730eb0c0d Raise custom ImportError if importing janome fails 2017-05-03 09:43:29 +02:00
Ines Montani
949ad6594b Add newline 2017-05-03 09:38:43 +02:00
Ines Montani
d12ca587ea Add newline 2017-05-03 09:38:29 +02:00
Ines Montani
8676cd0135 Add newline 2017-05-03 09:38:07 +02:00
Yasuaki Uechi
0e7a9b9fac Add Japanese to 'Alpha support’ section 2017-05-03 13:56:45 +09:00
Yasuaki Uechi
c8f83aeb87 Add basic japanese support 2017-05-03 13:56:21 +09:00
Ines Montani
f26a3b5a50 Merge pull request #1025 from Ferdous-Al-Imran/master 2017-04-27 14:36:37 +02:00
Ines Montani
fb96f88b59 Update info on CoNLL format and include link 2017-04-27 14:36:08 +02:00
Matthew Honnibal
31ec9e1371 Merge branch 'master' of https://github.com/explosion/spaCy 2017-04-27 13:21:39 +02:00
Matthew Honnibal
2da16adcc2 Add dropout optin for parser and NER
Dropout can now be specified in the `Parser.update()` method via
the `drop` keyword argument, e.g.

    nlp.entity.update(doc, gold, drop=0.4)

This will randomly drop 40% of features, and multiply the value of the
others by 1. / 0.4. This may be useful for generalising from small data
sets.

This commit also patches the examples/training/train_new_entity_type.py
example, to use dropout and fix the output (previously it did not output
the learned entity).
2017-04-27 13:18:39 +02:00
M. Z. Ferdous (Imran)
c9f9203d5f fix typo, CONLL format
tried to google about connlu format. Saw there is conll format, not connlu.
2017-04-27 16:48:54 +06:00
ines
5aa49971f9 Add French example to models docs 2017-04-27 12:08:47 +02:00
Ines Montani
7a894c9ef0 Update README.rst 2017-04-27 11:25:30 +02:00
ines
034ec5710b Fix typo and add Norwegian to alpha languages 2017-04-27 11:24:21 +02:00
Ines Montani
2f918e3004 Update README.rst 2017-04-27 11:18:41 +02:00
Ines Montani
bc88f9865e Remove file (already covered in PR) 2017-04-27 11:17:30 +02:00