Matthew Honnibal
|
6c15cf4193
|
Merge branch 'master' of ssh://github.com/explosion/spaCy
|
2016-10-19 00:16:28 +02:00 |
|
Matthew Honnibal
|
ae29b9bdfd
|
Fix travis and README conflicts
|
2016-10-19 00:16:11 +02:00 |
|
Matthew Honnibal
|
818dc83e26
|
Fix encoding error in setup.py
|
2016-10-19 00:05:53 +02:00 |
|
Matthew Honnibal
|
2bbb050500
|
Fix default of serializer_freqs
|
2016-10-18 19:55:41 +02:00 |
|
Matthew Honnibal
|
1b651db9c5
|
Fix parser creation in Language class.
|
2016-10-18 19:36:44 +02:00 |
|
Matthew Honnibal
|
45a6f9b9c7
|
Fix loading of tagger.
|
2016-10-18 19:33:04 +02:00 |
|
Matthew Honnibal
|
76c815f40d
|
Fix spacy.load
|
2016-10-18 19:23:31 +02:00 |
|
Matthew Honnibal
|
8c8f5c62c6
|
Add LANG attribute to English and German
|
2016-10-18 18:52:48 +02:00 |
|
Matthew Honnibal
|
05e2a589a4
|
Fix None label in matcher
|
2016-10-18 18:05:21 +02:00 |
|
Matthew Honnibal
|
c3a8a1cf51
|
Update serializer test.
|
2016-10-18 16:18:46 +02:00 |
|
Matthew Honnibal
|
7d5212f131
|
Refactor defaults
|
2016-10-18 16:18:25 +02:00 |
|
Matthew Honnibal
|
a45a9d5092
|
Remove stray .tensor attribute from Lexeme
|
2016-10-18 01:16:32 +02:00 |
|
Matthew Honnibal
|
9258db788a
|
Revert "Have the matcher return character offsets, to handle the match better."
This reverts commit 049c937540 .
|
2016-10-17 16:49:51 +02:00 |
|
Matthew Honnibal
|
7d446e5094
|
Revert "Update matcher test, to reflect character offset return instead of token offset."
This reverts commit f8d3e3bcfe .
|
2016-10-17 16:49:49 +02:00 |
|
Matthew Honnibal
|
4bf2c53c13
|
Revert "Hack on matcher tests, for new implementation."
This reverts commit dbe60644ab .
|
2016-10-17 16:49:48 +02:00 |
|
Matthew Honnibal
|
2fd97c71cc
|
Revert "Don't try to pickle matcher."
This reverts commit 97bd0c9d00 .
|
2016-10-17 16:49:43 +02:00 |
|
Matthew Honnibal
|
97bd0c9d00
|
Don't try to pickle matcher.
|
2016-10-17 16:38:40 +02:00 |
|
Matthew Honnibal
|
dbe60644ab
|
Hack on matcher tests, for new implementation.
|
2016-10-17 16:12:22 +02:00 |
|
Matthew Honnibal
|
f8d3e3bcfe
|
Update matcher test, to reflect character offset return instead of token offset.
|
2016-10-17 16:00:10 +02:00 |
|
Matthew Honnibal
|
049c937540
|
Have the matcher return character offsets, to handle the match better.
|
2016-10-17 15:58:57 +02:00 |
|
Matthew Honnibal
|
9b60186266
|
Fix doc class
|
2016-10-17 15:23:47 +02:00 |
|
Matthew Honnibal
|
6cbdc94959
|
Lots of updates to Matcher, to make entity handling sane.
|
2016-10-17 15:23:31 +02:00 |
|
Matthew Honnibal
|
7fd98fc91c
|
Remove deprecation shim around str/bytes in Token.
|
2016-10-17 14:02:47 +02:00 |
|
Matthew Honnibal
|
b67697a97b
|
Improve API for doc.merge() and span.merge(), to use keyword arguments.
|
2016-10-17 14:02:13 +02:00 |
|
Matthew Honnibal
|
fbb7f3f15c
|
Add user_data attribute to Doc object.
|
2016-10-17 11:43:22 +02:00 |
|
Matthew Honnibal
|
c1abc8f6ed
|
Fix deprecation stuff in Token: Remove the shim for the str/unicode semantics, and raise for has_repvec and repvec
|
2016-10-17 11:18:41 +02:00 |
|
Matthew Honnibal
|
4ba9eadf3d
|
Merge branch 'v1.0.0-rc1' of ssh://github.com/explosion/spaCy into v1.0.0-rc1
|
2016-10-17 02:45:44 +02:00 |
|
Matthew Honnibal
|
09ab447a18
|
Remove tensor property from token.
|
2016-10-17 02:45:09 +02:00 |
|
Matthew Honnibal
|
5d10e2005c
|
Defer some attributes to Doc, via getters_for_tokens attribute.
|
2016-10-17 02:44:49 +02:00 |
|
Matthew Honnibal
|
8829984efb
|
Remove tensor attribute from Span and Token.
|
2016-10-17 02:44:04 +02:00 |
|
Matthew Honnibal
|
d15a88c66a
|
Defer some attributes to Doc via getters_for_spans
|
2016-10-17 02:43:35 +02:00 |
|
Matthew Honnibal
|
62230dd13a
|
Add getters_for_spans and getters_for_tokens attributes to Doc. Fix docstring
|
2016-10-17 02:42:51 +02:00 |
|
Matthew Honnibal
|
ae11ea8240
|
Add getters_for_tokens and getters_for_spans attributes to Doc object.
|
2016-10-17 02:42:05 +02:00 |
|
Matthew Honnibal
|
be48a7b4f3
|
Fix conftest for website tests.
|
2016-10-17 01:54:26 +02:00 |
|
Matthew Honnibal
|
8951bf6989
|
Update matcher tests
|
2016-10-17 01:53:24 +02:00 |
|
Matthew Honnibal
|
0cf4aff470
|
Set default path in EN/DE tests.
|
2016-10-17 01:52:49 +02:00 |
|
Matthew Honnibal
|
cd71b6b0a9
|
Remove test of parser pickle
|
2016-10-17 01:52:10 +02:00 |
|
Matthew Honnibal
|
5bc101006e
|
Add cfg field to Tagger
|
2016-10-17 01:03:41 +02:00 |
|
Matthew Honnibal
|
517f090cbf
|
Use GoldParse in tagger.update
|
2016-10-17 00:55:15 +02:00 |
|
Matthew Honnibal
|
59038f7efa
|
Restore support for prior data format -- specifically, the labels field of the config.
|
2016-10-17 00:53:26 +02:00 |
|
Matthew Honnibal
|
c36e8676aa
|
Move old examples
|
2016-10-16 21:56:32 +02:00 |
|
Ines Montani
|
96bbcaf730
|
Update Gitter links
|
2016-10-16 21:53:24 +02:00 |
|
Ines Montani
|
7d5d2c81b2
|
Merge pull request #526 from crawfordcomeaux/badge-alts-and-gitter
Added Gitter badge, alt text for all badges to README.rst
|
2016-10-16 21:51:27 +02:00 |
|
Matthew Honnibal
|
7887ab3b36
|
Fix default use of feature_templates in parser
|
2016-10-16 21:41:56 +02:00 |
|
Matthew Honnibal
|
3fba897e0f
|
Update train_parser example
|
2016-10-16 21:41:14 +02:00 |
|
Matthew Honnibal
|
f787cd29fe
|
Refactor the pipeline classes to make them more consistent, and remove the redundant blank() constructor.
|
2016-10-16 21:34:57 +02:00 |
|
kengz
|
fb92e2d061
|
activate parse_tree test, use from_array, test for root correctness
|
2016-10-16 15:12:08 -04:00 |
|
kengz
|
17b7832419
|
mark test as needing models
|
2016-10-16 14:39:07 -04:00 |
|
kengz
|
f046e0d7c8
|
add parse_tree method to language, separate from __call__ for efficiency, but will use __call__ to get the doc
|
2016-10-16 14:20:23 -04:00 |
|
Matthew Honnibal
|
311a985fe0
|
Add input error handling in Doc
|
2016-10-16 18:16:42 +02:00 |
|