Alex Villarreal
bd35bf7f09
Guidance to handle binary files in git in Windows ( #2526 )
...
Adds guidance on what to do if users encounter the error described in [1634](https://github.com/explosion/spaCy/issues/1634 ), which probably only happens in Windows environments.
2018-07-09 18:31:37 +02:00
ines
fd6207426a
Merge branch 'master' into develop
2018-07-09 18:05:10 +02:00
Duygu Altinok
00b9a58558
German lemmatizer additions ( #2529 )
...
* lemma of was-> was
* added new pairs issue @2486
* added article tests
2018-07-09 11:10:15 +02:00
Ole Henrik Skogstrøm
c21efea9bb
Add sent property to token ( #2521 )
...
* Add sent property to token
* Refactored and cleaned up copy paste errors.
2018-07-06 15:54:15 +02:00
ines
38e07ade4c
Add test for custom tokenizer serialization ( resolves #2494 )
2018-07-06 12:40:51 +02:00
ines
c2581f9172
Tidy up tokenizer test
2018-07-06 12:40:28 +02:00
Matthew Honnibal
43dcaa473e
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2018-07-06 12:36:42 +02:00
Matthew Honnibal
6c8d627733
Fix tokenizer deserialization
2018-07-06 12:36:33 +02:00
ines
c001d46153
Tidy up
2018-07-06 12:33:42 +02:00
Matthew Honnibal
63f5651f8d
Fix tokenizer serialization
2018-07-06 12:32:11 +02:00
Matthew Honnibal
e1569fda4e
Fix compile error in matcher
2018-07-06 12:29:23 +02:00
Matthew Honnibal
f5b2076700
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2018-07-06 12:23:14 +02:00
Matthew Honnibal
1a2f61725c
Fix tokenizer serialization
2018-07-06 12:23:04 +02:00
ines
9e09477b2f
Remove unused import
2018-07-06 12:18:17 +02:00
ines
26f04a6ac3
Fix Matcher tests and add test for any token with operator
2018-07-06 12:17:50 +02:00
Matthew Honnibal
f5703b7a91
Clean up unused stuff in matcher
2018-07-06 12:16:44 +02:00
Matthew Honnibal
08c362d541
Suppress compiler warning about unreachable code
2018-07-06 11:31:22 +02:00
Matthew Honnibal
8ae1bec8bf
Fix init_model
2018-07-05 14:02:06 +02:00
Matthew Honnibal
7b09a4ca49
Fix lemmatization
2018-07-05 13:56:02 +02:00
Matthew Honnibal
ec41ceb383
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2018-07-05 13:49:42 +02:00
Matthew Honnibal
4eb3405df7
Fix lemmatizer ordering, re Issue #1387
2018-07-05 13:49:29 +02:00
ines
f575b01595
Update language and license meta [ci skip]
2018-07-04 15:09:36 +02:00
ines
83e88553d4
Add Python 3.7 to setup.py ( resolves #2505 )
2018-07-04 14:53:51 +02:00
ines
63666af328
Merge branch 'master' into develop
2018-07-04 14:52:25 +02:00
ines
8feb7cfe2d
Remove model dependency from French lemmatizer tests
2018-07-04 14:46:45 +02:00
kleinay
a82c3153ad
fix issue #2452 - displacy arrow direction is always forward ( #2506 ) ( closes #2452 )
...
<!--- Provide a general summary of your changes in the title. -->
Referring #2452 , fixing displacy arrow directions to match the input.
## Description
The fix is simply replacing `direction is 'left'` with `direction == 'left'` to include the case `direction` is a `str` and not a `unicode`.
### Types of change
bug fix
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [ ] I have submitted the spaCy Contributor Agreement.
- [ ] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-04 14:12:08 +02:00
Matthew Honnibal
a0db2e0077
Build pytest into spaCy pex, for ease of testing
2018-07-04 13:22:58 +02:00
Bùi Trung Chí
9af46b4f1b
Fix loading tokenizer with custom prefix search ( #2495 )
...
* Add contributor agreement
* Fix loading tokenizer with cutom prefix search
2018-07-04 12:56:07 +02:00
Matthew Honnibal
dee8bdb900
Fix init-model for npz vectors
2018-07-04 02:29:48 +02:00
Matthew Honnibal
59d655e8d0
Fix model init from jsonl
2018-07-04 01:30:40 +02:00
Matthew Honnibal
1e38bea6e9
Save vectors init
2018-07-03 23:55:04 +02:00
Matthew Honnibal
6692833887
Fix init_model
2018-07-03 23:24:11 +02:00
Matthew Honnibal
4a38a26cb5
Fix init_model
2018-07-03 22:57:11 +02:00
Matthew Honnibal
019d09e3c3
Fix init model
2018-07-03 22:16:44 +02:00
Matthew Honnibal
2543f8c93a
Support .npz vectors in init-model command
2018-07-03 21:42:16 +02:00
Matthew Honnibal
86aad11939
Fix init_model arg
2018-07-03 17:00:42 +02:00
Matthew Honnibal
eff42d36e3
Fix init model command
2018-07-03 16:32:23 +02:00
Matthew Honnibal
97487122ea
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2018-07-03 15:44:37 +02:00
Matthew Honnibal
6a89faf12e
Add support for jsonl-formatted lexical attributes to init-model command.
2018-07-03 12:22:56 +02:00
Matthew Honnibal
a85620a731
Note CoreNLP tokenizer correction on website
2018-07-02 11:35:31 +02:00
Matthew Honnibal
3c3020fccc
Update Makefile
2018-06-29 21:21:30 +02:00
Matthew Honnibal
2ec2192000
Revert #1389 : Don't overrule rules when lemma exception is present
2018-06-29 19:43:02 +02:00
Matthew Honnibal
01ace9734d
Make pipeline work on empty docs
2018-06-29 19:21:38 +02:00
Matthew Honnibal
d0f9f13543
Update Makefile
2018-06-29 19:01:44 +02:00
Matthew Honnibal
a1b05048d0
Fix tagger when doc is empty
2018-06-29 16:05:40 +02:00
Matthew Honnibal
3786942ff1
Fix tagger when docs are empty
2018-06-29 15:13:45 +02:00
ines
526be40823
Add test for 46d8a66
2018-06-29 14:33:12 +02:00
ines
f08c871adf
Fix typo in Language.from_disk
2018-06-29 14:32:16 +02:00
Matthew Honnibal
46d8a66fef
Fix tokenizer serialization if token_match is None
2018-06-29 14:24:46 +02:00
Matthew Honnibal
e0860bcfb3
Fix bug when docs are empty
2018-06-29 13:56:29 +02:00