Ines Montani
730f759b4f
Merge branch 'master' into spacy.io
2019-03-28 15:26:17 +01:00
Ines Montani
7d033a7b89
Fix met a description in universe projects [ci skip]
2019-03-28 15:26:01 +01:00
Ines Montani
fe2cb642ac
Merge branch 'master' into spacy.io
2019-03-28 15:13:39 +01:00
David
74e738dd4d
adds textpipe to universe ( #3500 ) [ci skip]
...
* Adds textpipe to universe
* signed contributor agreement
* Adjust formatting, code style and use "standalone" category
2019-03-28 15:13:19 +01:00
Ines Montani
04a9fb1a02
Merge branch 'master' into spacy.io
2019-03-28 13:34:46 +01:00
Samuel Kane
06a1846379
fix(util): fix decaying function output ( #3495 )
...
* fix(util): fix decaying function output
* fix(util): better test and adhere to code standards
* fix(util): correct variable name, pytestify test, update website text
2019-03-28 13:24:47 +01:00
Duygu Altinok
5a7bc6b39d
Fix/irreg adverbs extension ( #3499 )
...
* extended list of irreg adverbs
* added test to exceptions
* fixed typo
2019-03-28 13:23:33 +01:00
Bharat Raghunathan
1db3e47509
DOC: Update tokenizer docs to include default value for batch_size in pipe ( #3492 )
2019-03-28 12:48:02 +01:00
Ines Montani
2ed16d82bf
Fix social image
2019-03-26 18:27:40 +01:00
Matthew Honnibal
f77bf2bdb1
Fix GPU training for textcat. Closes #3473
2019-03-26 13:36:11 +01:00
Sofie
a4a6bfa4e1
Merge branch 'master' into feature/el-framework
2019-03-26 11:00:02 +01:00
svlandeg
8814b9010d
entity as one field instead of both ID and name
2019-03-25 18:10:41 +01:00
Ines Montani
9e14b2b69f
Add Estonian to docs [ci skip] ( closes #3482 )
2019-03-25 18:01:54 +01:00
Wannaphong Phatthiyaphaibun
297a051992
Update Thai tag map ( #3480 )
...
* Update Thai tag map
Update Thai tag map
* Create wannaphongcom.md
2019-03-25 16:53:26 +01:00
Ines Montani
21ade53ef7
Merge branch 'master' into spacy.io
2019-03-25 13:05:00 +01:00
Ines Montani
db938ab0e3
Update favicon ( closes #3475 ) [ci skip]
2019-03-25 13:04:47 +01:00
Ines Montani
c8c1baaea8
Update binderVersion
2019-03-25 12:17:03 +01:00
Matthew Honnibal
85dcd9477e
Set version to v2.1.3
2019-03-23 16:47:57 +01:00
Matthew Honnibal
f436efd8a4
Small tweak to ensemble textcat model
2019-03-23 16:47:26 +01:00
Ines Montani
200d8bdb3c
Merge branch 'spacy.io' [ci skip]
2019-03-23 16:46:34 +01:00
Ines Montani
1e5b917d75
Fix formatting [ci skip]
2019-03-23 16:45:50 +01:00
Matthew Honnibal
6c783f8045
Bug fixes and options for TextCategorizer ( #3472 )
...
* Fix code for bag-of-words feature extraction
The _ml.py module had a redundant copy of a function to extract unigram
bag-of-words features, except one had a bug that set values to 0.
Another function allowed extraction of bigram features. Replace all three
with a new function that supports arbitrary ngram sizes and also allows
control of which attribute is used (e.g. ORTH, LOWER, etc).
* Support 'bow' architecture for TextCategorizer
This allows efficient ngram bag-of-words models, which are better when
the classifier needs to run quickly, especially when the texts are long.
Pass architecture="bow" to use it. The extra arguments ngram_size and
attr are also available, e.g. ngram_size=2 means unigram and bigram
features will be extracted.
* Fix size limits in train_textcat example
* Explain architectures better in docs
2019-03-23 16:44:44 +01:00
Ines Montani
5944cf10c7
Add blog post to v2.1 page
2019-03-23 16:34:23 +01:00
Ines Montani
ffebdad08d
Add cheat sheet to spaCy 101
2019-03-23 16:32:55 +01:00
Ines Montani
06bf130890
💫 Add better and serializable sentencizer ( #3471 )
...
* Add better serializable sentencizer component
* Replace default factory
* Add tests
* Tidy up
* Pass test
* Update docs
2019-03-23 15:45:02 +01:00
Matthew Honnibal
d9a07a7f6e
💫 Fix class mismap on parser deserializing ( closes #3433 ) ( #3470 )
...
v2.1 introduced a regression when deserializing the parser after
parser.add_label() had been called. The code around the class mapping is
pretty confusing currently, as it was written to accommodate backwards
model compatibility. It needs to be revised when the models are next
retrained.
Closes #3433
2019-03-23 13:46:25 +01:00
Matthew Honnibal
444a3abfe5
Add xfail test for #3433 . Improve test for add label.
2019-03-23 12:36:00 +01:00
Ines Montani
6b6e9b638e
Fix test for #3468
2019-03-23 11:24:29 +01:00
Ines Montani
fbec72b4c3
Slightly modify test for #3468
...
Check for Token.is_sent_start first (which is serialized/deserialized correctly)
2019-03-23 11:22:44 +01:00
Ines Montani
02d9378d8c
Add xfailing test for #3468
2019-03-23 11:19:11 +01:00
Ines Montani
ed91592726
Merge branch 'master' into spacy.io
2019-03-22 19:02:26 +01:00
Ines Montani
dcd6e06c47
Improve landing example [ci skip]
2019-03-22 19:02:15 +01:00
Ines Montani
c2bb39dcb4
Merge branch 'master' into spacy.io
2019-03-22 18:50:16 +01:00
Ines Montani
a841324034
Update landing example [ci skip]
2019-03-22 18:50:00 +01:00
Ines Montani
a9ad735241
Merge branch 'master' into spacy.io
2019-03-22 18:36:28 +01:00
Ines Montani
b532386a60
Fix typo [ci skip]
2019-03-22 18:36:17 +01:00
Ines Montani
7b5496027b
Merge branch 'master' into spacy.io
2019-03-22 18:21:16 +01:00
Ines Montani
d8533f0149
Update Binder [ci skip]
2019-03-22 18:16:46 +01:00
svlandeg
46f4eb5db3
error and warning messages
2019-03-22 16:55:05 +01:00
svlandeg
9de9900510
adding future import unicode literals to .py files
2019-03-22 16:18:04 +01:00
svlandeg
b4cd5d5ee9
property annotations for fields with only a getter
2019-03-22 16:10:49 +01:00
Matthew Honnibal
4c5f265884
Fix train loop for train_textcat example
2019-03-22 16:10:11 +01:00
Ines Montani
680eafab94
Merge branch 'master' into spacy.io
2019-03-22 15:17:51 +01:00
Christos Aridas
9cee3f702a
Add missing space in landing page ( #3462 ) [ci skip]
2019-03-22 15:17:35 +01:00
Ines Montani
5073ce63fd
Merge branch 'spacy.io' [ci skip]
2019-03-22 15:17:11 +01:00
svlandeg
9751312aff
specify unicode strings for python 2.7
2019-03-22 14:15:18 +01:00
svlandeg
5318ce88fa
'entity_linker' instead of 'el'
2019-03-22 13:55:10 +01:00
svlandeg
ec3e860b44
Merge remote-tracking branch 'upstream/master' into feature/el-framework
2019-03-22 13:47:08 +01:00
Ines Montani
c9bd0e5a96
Set version to 2.1.2
2019-03-22 13:44:47 +01:00
svlandeg
12d4caf341
Merge remote-tracking branch 'upstream/master' into feature/el-framework
2019-03-22 13:44:36 +01:00