Stefan Bunk
2bf19d4735
Fix error in pipeline loading documentation
...
The cell for the `vocab` parameter is not displayed, making it seem as if the explanation belongs to the previous param.
2017-02-10 12:06:55 +01:00
ines
f02a2f9322
Revert "Merge pull request #818 from raphael0202/tokenizer_exceptions"
...
This reverts commit b95afdf39c
, reversing
changes made to b0ccf32378
.
2017-02-09 17:07:21 +01:00
Ines Montani
b95afdf39c
Merge pull request #818 from raphael0202/tokenizer_exceptions
...
Add tokenizer exceptions for French
2017-02-09 16:41:21 +01:00
Raphaël Bournhonesque
309da78bf0
Merge branch 'master' into tokenizer_exceptions
2017-02-09 16:32:12 +01:00
Raphaël Bournhonesque
4ce0bbc6b6
Update unit tests
2017-02-09 16:30:43 +01:00
Raphaël Bournhonesque
5d706ab95d
Merge tokenizer exceptions from PR #802
2017-02-09 16:30:28 +01:00
Ines Montani
b0ccf32378
Update CONTRIBUTING.md
2017-02-09 16:27:31 +01:00
ines
1b8719bf9a
Adjust formatting and increment version
2017-02-08 21:33:22 +01:00
Ines Montani
c63bf3fc94
Merge pull request #814 from wehlutyk/website-nav-hysteresis
...
Make the website nav header's hysteresis a bit more robust
2017-02-08 21:30:34 +01:00
Sébastien Lerique
e1f87858ad
Make the website nav header's hysteresis a bit more robust
...
In particular, this prevents the nav header from reappearing all the
time while scrolling down on Firefox.
2017-02-08 15:08:33 +01:00
Ines Montani
8ac741c217
Merge pull request #811 from knub/patch-1
...
Fix error in matching documentation
2017-02-07 16:57:02 +01:00
Stefan Bunk
e972b2fa87
Fix error in matching documentation
...
LOWER and IS_PUNCT are members of `spacy` and not of the `Matcher` class.
2017-02-07 16:52:01 +01:00
ines
654fe447b1
Add Swedish tokenizer tests (see #807 )
2017-02-05 11:47:07 +01:00
ines
6715615d55
Add missing EXC variable and combine tokenizer exceptions
2017-02-05 11:42:52 +01:00
Ines Montani
30a52d576b
Merge pull request #807 from magnusburton/master
...
Added swedish lemma rules and more verb contractions
2017-02-05 11:34:19 +01:00
Matthew Honnibal
9aaa2c5633
Fix entity recognition example ( closes #803 )
2017-02-05 11:23:12 +01:00
Magnus Burton
19c0ce745a
Added swedish lemma rules
2017-02-04 17:53:32 +01:00
Ines Montani
cf529f4774
Merge pull request #806 from wallinm1/fix/swedish-tokenizer-exceptions
...
Fix issue #805
2017-02-04 17:40:40 +01:00
Michael Wallin
d25556bf80
[issue 805] Fix issue
2017-02-04 16:22:21 +02:00
Michael Wallin
35100c8bdd
[issue 805] Add regression test and the required fixture
2017-02-04 16:21:34 +02:00
ines
a44da8fb34
Update language models and alpha support overview
2017-02-04 13:49:05 +01:00
Ines Montani
708cd37a2e
Update README.rst
2017-02-04 13:42:46 +01:00
Ines Montani
ff91be6d17
Update CONTRIBUTORS.md
2017-02-04 13:41:21 +01:00
ines
0ab353b0ca
Add line breaks to Finnish stop words for better readability
2017-02-04 13:40:25 +01:00
Ines Montani
3431e7b86f
Merge pull request #804 from wallinm1/finnish-alpha-support
...
Alpha support for Finnish
2017-02-04 13:37:08 +01:00
Michael Wallin
55b1e5e682
[finnish] Add contributor file
2017-02-04 13:54:10 +02:00
Michael Wallin
1a1952afa5
[finnish] Add initial tests for tokenizer
2017-02-04 13:54:10 +02:00
Michael Wallin
f9bb25d1cf
[finnish] Reformat and correct stop words
2017-02-04 13:54:10 +02:00
Michael Wallin
73f66ec570
Add preliminary support for Finnish
2017-02-04 13:54:10 +02:00
Ines Montani
932aaba7de
Update CONTRIBUTORS.md
2017-02-03 10:55:42 +01:00
Ines Montani
65d6202107
Merge pull request #802 from Tpt/fr-tokenizer
...
Adds more French tokenizer exceptions
2017-02-03 10:52:20 +01:00
Tpt
75a74857bb
Adds more French tokenizer exceptions
2017-02-03 13:45:18 +04:00
Ines Montani
afc6365388
Update regression test for #801 to match current expected behaviour
2017-02-02 16:23:05 +01:00
Ines Montani
012f4820cb
Keep infixes of punctuation + hyphens as one token (see #801 )
2017-02-02 16:22:40 +01:00
Ines Montani
1219a5f513
Add = to tokenizer prefixes
2017-02-02 16:21:11 +01:00
Ines Montani
ff04748eb6
Add missing emoticon
2017-02-02 16:21:00 +01:00
Ines Montani
13a4ab37e0
Add regression test for #801
2017-02-02 15:33:52 +01:00
Raphaël Bournhonesque
85f951ca99
Add tokenizer exceptions for French
2017-02-02 08:36:16 +01:00
Matthew Honnibal
16ce7409e4
Merge branch 'master' of https://github.com/explosion/spaCy
2017-01-31 13:27:34 -06:00
Matthew Honnibal
80aa4e114b
Fix x keras deep learning example
2017-01-31 13:27:13 -06:00
Ines Montani
ad0e4e4532
Merge pull request #794 from ematvey/count_by_doc_update
...
Small `Doc.count_by` documentation update
2017-01-31 20:11:47 +01:00
Matvey Ezhov
32a22291bc
Small Doc.count_by
documentation update
...
Current example doesn't work
2017-01-31 19:18:45 +03:00
Ines Montani
e4875834fe
Fix formatting
2017-01-31 15:19:33 +01:00
Ines Montani
c304834e45
Add missing import
2017-01-31 15:18:30 +01:00
Ines Montani
626ac282fe
Merge pull request #793 from latkins/master
...
Added regression test for Issue #792 .
2017-01-31 15:16:23 +01:00
Ines Montani
e6465b9ca3
Parametrize test cases and mark as xfail
2017-01-31 15:14:42 +01:00
latkins
e4c84321a5
Added regression test for Issue #792 .
2017-01-31 13:47:42 +00:00
Matthew Honnibal
6c665b81df
Fix redundant == TAG in from_array conditional
2017-01-31 00:46:21 +11:00
Matthew Honnibal
3ea0df6ba7
Merge pull request #782 from raphael0202/dep_version
...
Specify version number for ujson and plac
2017-01-29 05:32:45 +11:00
Raphaël Bournhonesque
0c2e5539ce
Specify version number for ujson and plac
...
The required version was specified for plac in requirements.txt but not in setup.py, which could cause a conflicting version error.
Similarly, set the version of ujson in requirements.txt to be the same as in setup.py
2017-01-28 18:38:14 +01:00