ines
|
1da29a7146
|
Use new Lemmatizer data and remove file import
Since there's currently only an English lemmatizer, the global
Lemmatizer imports from spacy.en. This is unideal and still needs to be
fixed.
|
2017-03-12 13:58:22 +01:00 |
|
ines
|
c89e30d1a3
|
Add test for English time exceptions ("1a.m." etc.)
|
2017-03-12 13:58:22 +01:00 |
|
ines
|
66c1f194f9
|
Use consistent unicode declarations
|
2017-03-12 13:07:28 +01:00 |
|
Matthew Honnibal
|
ea2592879f
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2017-03-11 11:13:37 -06:00 |
|
ines
|
10e29189ac
|
Adjust URL testcases and xfail problems (instead of comment)
|
2017-03-10 14:22:50 +01:00 |
|
Matthew Honnibal
|
ea53647362
|
Merge branch 'develop'
|
2017-03-10 02:49:39 -06:00 |
|
Dan Rapp
|
123d3f2d38
|
Fix error in test case parameterization
|
2017-03-09 12:18:21 -07:00 |
|
Dan Rapp
|
b9307dfcd7
|
Merge branch 'master' into rappdw/tokenizer_exceptions_url_fix
|
2017-03-09 11:42:14 -07:00 |
|
Dan Rapp
|
3b1df3808d
|
Issue #840 - URL pattenr too broad
|
2017-03-09 11:39:39 -07:00 |
|
Matthew Honnibal
|
5b0b968d13
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-03-08 15:03:10 +01:00 |
|
Matthew Honnibal
|
0ac3d27689
|
Fix handling of trailing whitespace
Fix off-by-one error that meant trailing spaces were being dropped.
Closes #792
|
2017-03-08 15:01:40 +01:00 |
|
ines
|
c2e3e651b8
|
Re-add regression test for #859
|
2017-03-08 14:36:09 +01:00 |
|
Matthew Honnibal
|
16670d3251
|
Xfail the vocab pickling for now
|
2017-03-07 21:43:28 +01:00 |
|
Matthew Honnibal
|
a89c3500f6
|
Fixes to hacky vocab pickling
|
2017-03-07 20:58:55 +01:00 |
|
Matthew Honnibal
|
3edb8ae207
|
Whitespace
|
2017-03-07 17:16:26 +01:00 |
|
Matthew Honnibal
|
5de7e712b7
|
Add support for pickling StringStore.
|
2017-03-07 17:15:18 +01:00 |
|
Matthew Honnibal
|
4e75e74247
|
Update regression test for variable-length pattern problem in the matcher.
|
2017-03-07 16:08:32 +01:00 |
|
Matthew Honnibal
|
6d67213b80
|
Add test for 850: Matcher fails on zero-or-more.
|
2017-03-07 15:55:28 +01:00 |
|
Aniruddha Adhikary
|
696215a3fb
|
add tests for Bengali
|
2017-03-05 11:25:12 +06:00 |
|
ines
|
8dff040032
|
Revert "Add regression test for #859"
This reverts commit c4f16c66d1 .
|
2017-03-01 21:56:20 +01:00 |
|
ines
|
c4f16c66d1
|
Add regression test for #859
|
2017-03-01 16:07:27 +01:00 |
|
Matthew Honnibal
|
34bcc8706d
|
Merge branch 'french-tokenizer-exceptions'
|
2017-02-27 11:21:21 +01:00 |
|
Matthew Honnibal
|
0aaa546435
|
Fix test after updating the French tokenizer stuff
|
2017-02-27 11:20:47 +01:00 |
|
ines
|
376c5813a7
|
Remove print statements from test
|
2017-02-24 18:26:32 +01:00 |
|
ines
|
7c1260e98c
|
Add regression test
|
2017-02-24 18:22:49 +01:00 |
|
ines
|
51eb190ef4
|
Remove print statements from test
|
2017-02-24 17:41:12 +01:00 |
|
Matthew Honnibal
|
db5ada3995
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2017-02-24 14:28:12 +01:00 |
|
Matthew Honnibal
|
8f94897d07
|
Add 1 operator to matcher, and make sure open patterns are closed at end of document. Closes Issue #766
|
2017-02-24 14:27:02 +01:00 |
|
ines
|
67991b6e5f
|
Add more test cases to #775 regression test to cover #847
|
2017-02-18 14:10:44 +01:00 |
|
ines
|
44de3c7642
|
Reformat test and use text_file fixture
|
2017-02-16 23:49:19 +01:00 |
|
ines
|
3dd22e9c88
|
Mark vectors test as xfail (temporary)
|
2017-02-16 23:28:51 +01:00 |
|
ines
|
85d249d451
|
Revert "Revert "Merge pull request #836 from raphael0202/load_vectors (closes #834)""
This reverts commit ea05f78660 .
|
2017-02-16 23:26:25 +01:00 |
|
ines
|
ea05f78660
|
Revert "Merge pull request #836 from raphael0202/load_vectors (closes #834)"
This reverts commit 7d8c9eee7f , reversing
changes made to f6b69babcc .
|
2017-02-16 15:27:12 +01:00 |
|
Raphaël Bournhonesque
|
06a71d22df
|
Fix test failure by using unicode literals
|
2017-02-16 14:48:00 +01:00 |
|
Raphaël Bournhonesque
|
3ba109622c
|
Add regression test with non ' ' space character as token
|
2017-02-16 12:23:27 +01:00 |
|
ines
|
21f09d10d7
|
Revert "Revert "Merge pull request #818 from raphael0202/tokenizer_exceptions""
This reverts commit f02a2f9322 .
|
2017-02-10 13:17:05 +01:00 |
|
ines
|
f02a2f9322
|
Revert "Merge pull request #818 from raphael0202/tokenizer_exceptions"
This reverts commit b95afdf39c , reversing
changes made to b0ccf32378 .
|
2017-02-09 17:07:21 +01:00 |
|
Raphaël Bournhonesque
|
309da78bf0
|
Merge branch 'master' into tokenizer_exceptions
|
2017-02-09 16:32:12 +01:00 |
|
Raphaël Bournhonesque
|
4ce0bbc6b6
|
Update unit tests
|
2017-02-09 16:30:43 +01:00 |
|
ines
|
654fe447b1
|
Add Swedish tokenizer tests (see #807)
|
2017-02-05 11:47:07 +01:00 |
|
Michael Wallin
|
35100c8bdd
|
[issue 805] Add regression test and the required fixture
|
2017-02-04 16:21:34 +02:00 |
|
Michael Wallin
|
1a1952afa5
|
[finnish] Add initial tests for tokenizer
|
2017-02-04 13:54:10 +02:00 |
|
Ines Montani
|
afc6365388
|
Update regression test for #801 to match current expected behaviour
|
2017-02-02 16:23:05 +01:00 |
|
Ines Montani
|
13a4ab37e0
|
Add regression test for #801
|
2017-02-02 15:33:52 +01:00 |
|
Raphaël Bournhonesque
|
85f951ca99
|
Add tokenizer exceptions for French
|
2017-02-02 08:36:16 +01:00 |
|
Ines Montani
|
e4875834fe
|
Fix formatting
|
2017-01-31 15:19:33 +01:00 |
|
Ines Montani
|
c304834e45
|
Add missing import
|
2017-01-31 15:18:30 +01:00 |
|
Ines Montani
|
e6465b9ca3
|
Parametrize test cases and mark as xfail
|
2017-01-31 15:14:42 +01:00 |
|
latkins
|
e4c84321a5
|
Added regression test for Issue #792.
|
2017-01-31 13:47:42 +00:00 |
|
Ines Montani
|
19501f3340
|
Add regression test for #775
|
2017-01-25 13:16:52 +01:00 |
|