ines
|
9d85cda8e4
|
Fix models error message and use about.__docs_models__ (see #1051)
|
2017-05-13 13:05:47 +02:00 |
|
ines
|
6b942763f0
|
Tidy up imports
|
2017-05-13 13:04:40 +02:00 |
|
ines
|
b9dea345e5
|
Remove old import
|
2017-05-13 12:32:11 +02:00 |
|
ines
|
293ee359c5
|
Fix formatting
|
2017-05-13 12:32:06 +02:00 |
|
Matthew Honnibal
|
ee1d35bdb0
|
Fix merge conflict
|
2017-05-13 03:20:19 +02:00 |
|
Matthew Honnibal
|
b2540d2379
|
Merge Kengz's tree_print patch
|
2017-05-13 03:18:49 +02:00 |
|
Matthew Honnibal
|
4efb391994
|
Fix serializer
|
2017-05-09 18:45:18 +02:00 |
|
Matthew Honnibal
|
1166b0c491
|
Implement Doc.to_bytes and Doc.from_bytes methods
|
2017-05-09 18:11:34 +02:00 |
|
Matthew Honnibal
|
9e167b7bb6
|
Strip serializer from code
|
2017-05-09 17:28:50 +02:00 |
|
ines
|
0739ae7b76
|
Tidy up and fix formatting and imports
|
2017-04-15 13:05:15 +02:00 |
|
ines
|
e71a1f4bd0
|
Fix download commands in error messages (see #946)
|
2017-04-01 10:20:57 +02:00 |
|
Matthew Honnibal
|
51882ee2b8
|
Fix check for setting ent_id in merge
|
2017-03-31 19:32:01 +02:00 |
|
Matthew Honnibal
|
9720103428
|
Improve attribute handlign in doc.merge(). Still unsatisfying
|
2017-03-31 13:59:58 +02:00 |
|
Matthew Honnibal
|
0fefdfcbda
|
Merge pull request #935 from ericzhao28/master
Add option to use label=ent_type in doc.merge arguments (Bug fix for issue #862)
|
2017-03-30 02:51:24 +02:00 |
|
Eric Zhao
|
aafdf6ffb8
|
Add option to use label karg to determine ent_type in doc.merge
|
2017-03-28 23:35:03 -07:00 |
|
Roman Inflianskas
|
66e1109b53
|
Add support for Universal Dependencies v2.0
|
2017-03-03 13:17:34 +01:00 |
|
Matvey Ezhov
|
32a22291bc
|
Small Doc.count_by documentation update
Current example doesn't work
|
2017-01-31 19:18:45 +03:00 |
|
Matthew Honnibal
|
6c665b81df
|
Fix redundant == TAG in from_array conditional
|
2017-01-31 00:46:21 +11:00 |
|
Matthew Honnibal
|
44e2b0100d
|
Support TAG attribute in doc.from_array
|
2017-01-10 22:47:07 +01:00 |
|
kengz
|
73a38bd4d1
|
Merge remote-tracking branch 'upstream/master'
|
2016-12-30 12:19:59 -05:00 |
|
kengz
|
da44183ae1
|
move parse_tree logic to a new tokens/printers.py file
|
2016-12-30 12:19:18 -05:00 |
|
Pokey Rule
|
3e3bda142d
|
Add noun_chunks to Span
|
2016-11-24 10:47:20 +00:00 |
|
Matthew Honnibal
|
1fb09c3dc1
|
Fix morphology tagger
|
2016-11-04 19:19:09 +01:00 |
|
Matthew Honnibal
|
f292f7f0e6
|
Fix Issue #599, by considering empty documents to be parsed and tagged. Implementation is a bit dodgy.
|
2016-11-02 23:48:43 +01:00 |
|
Matthew Honnibal
|
e7af6b937f
|
Fix syntax error while fixing doc strings
|
2016-11-01 13:27:32 +01:00 |
|
Matthew Honnibal
|
b86f8af0c1
|
Fix doc strings
|
2016-11-01 12:25:36 +01:00 |
|
Matthew Honnibal
|
4ca31b4d87
|
Fix clobbering of 'missing' named ent values after assigning ents.
|
2016-10-26 13:13:56 +02:00 |
|
Matthew Honnibal
|
15c9b59f0e
|
Fix Issue #461: O tag was being clobbered by doc.ents.__set__
|
2016-10-23 15:50:26 +02:00 |
|
Matthew Honnibal
|
2c3a67b693
|
Fix calculation of vector norm, re Issue #522. Need to consolidate the calculations into a helper function.
|
2016-10-23 14:49:31 +02:00 |
|
Matthew Honnibal
|
3588a18fb8
|
Fix hook names in doc
|
2016-10-19 21:15:16 +02:00 |
|
Matthew Honnibal
|
5d5742b773
|
Add sentiment field to doc, rename getters_for_tokens and getters_for_spans, add user_hooks field to Doc.
|
2016-10-19 20:54:22 +02:00 |
|
Matthew Honnibal
|
9b60186266
|
Fix doc class
|
2016-10-17 15:23:47 +02:00 |
|
Matthew Honnibal
|
b67697a97b
|
Improve API for doc.merge() and span.merge(), to use keyword arguments.
|
2016-10-17 14:02:13 +02:00 |
|
Matthew Honnibal
|
fbb7f3f15c
|
Add user_data attribute to Doc object.
|
2016-10-17 11:43:22 +02:00 |
|
Matthew Honnibal
|
62230dd13a
|
Add getters_for_spans and getters_for_tokens attributes to Doc. Fix docstring
|
2016-10-17 02:42:51 +02:00 |
|
Matthew Honnibal
|
311a985fe0
|
Add input error handling in Doc
|
2016-10-16 18:16:42 +02:00 |
|
Matthew Honnibal
|
06322ba99d
|
Add words and spaces keyword arguments to Doc.
|
2016-10-16 18:13:03 +02:00 |
|
Matthew Honnibal
|
6736977d82
|
Revert "Changes to Doc and Token for new string store scheme"
This reverts commit 99de44d864 .
|
2016-09-30 20:11:15 +02:00 |
|
Matthew Honnibal
|
99de44d864
|
Changes to Doc and Token for new string store scheme
|
2016-09-30 20:00:21 +02:00 |
|
Matthew Honnibal
|
d3dc5718b2
|
Fix syntax error in Doc
|
2016-09-28 11:39:49 +02:00 |
|
Matthew Honnibal
|
1b520e7bab
|
Improve docstrings for Doc object
|
2016-09-28 11:15:13 +02:00 |
|
Matthew Honnibal
|
fc4a7ad794
|
Test and fix Issue #411: IndexError when .sents property is used on empty string.
|
2016-09-27 18:49:14 +02:00 |
|
Matthew Honnibal
|
15e42a1ba9
|
Allow entities to be set by Span, or by 4-tuple (with entity ID)
|
2016-09-24 01:17:43 +02:00 |
|
Matthew Honnibal
|
2735b6247b
|
Fix orths_and_spaces in Doc.__init__
|
2016-09-21 14:52:05 +02:00 |
|
Matthew Honnibal
|
cdc10e9a1c
|
* Fix Issue #375: noun phrase iteration results in index error if noun phrases are merged during the loop. Fix by accumulating the spans inside the noun_chunks property, allowing the Span index tricks to work.
|
2016-05-20 10:14:06 +02:00 |
|
Matthew Honnibal
|
5d86c30f0b
|
* Fix Issue #367: Missing has_vector property on Doc and Span objects
|
2016-05-09 12:36:14 +02:00 |
|
Matthew Honnibal
|
76021cb853
|
* Fix bug in Doc.text, introduced by a862edc
|
2016-05-04 11:02:16 +02:00 |
|
Matthew Honnibal
|
29a114e645
|
* Don't assign 0-valued tags in Doc.from_array
|
2016-05-02 16:07:50 +02:00 |
|
Matthew Honnibal
|
508fd1f6dc
|
* Refactor noun chunk iterators, so that they're simple functions. Install the iterator when the Doc is created, but allow users to write to the noun_chunk_iterator attribute. The iterator functions accept an object and yield (int start, int end, int label) triples.
|
2016-05-02 14:25:10 +02:00 |
|
Matthew Honnibal
|
872695759d
|
Merge pull request #306 from wbwseeker/german_noun_chunks
add German noun chunk functionality
|
2016-04-08 00:54:24 +10:00 |
|