Matthew Honnibal
|
394633efce
|
Make doc pickling support hooks
|
2017-10-17 19:44:09 +02:00 |
|
Matthew Honnibal
|
cdb0c426d8
|
Improve deserialization of user_data, esp. for Underscore
|
2017-10-17 19:29:20 +02:00 |
|
Matthew Honnibal
|
32a8564c79
|
Fix doc pickling
|
2017-10-17 18:20:24 +02:00 |
|
Matthew Honnibal
|
92c1eb2d6f
|
Fix Doc pickling. This also removes need for Binder class
|
2017-10-17 16:11:13 +02:00 |
|
Matthew Honnibal
|
a002264fec
|
Remove caching of Token in Doc, as caused cycle.
|
2017-10-16 19:34:21 +02:00 |
|
ines
|
e0ff145a8b
|
Merge branch 'develop' into feature/dot-underscore
|
2017-10-11 11:57:05 +02:00 |
|
Matthew Honnibal
|
3b527fa52b
|
Call morphology.assign_untagged when pushing token to Doc
|
2017-10-11 03:23:57 +02:00 |
|
Matthew Honnibal
|
e0a9b02b67
|
Merge Span._ and Span.as_doc methods
|
2017-10-09 22:00:15 -05:00 |
|
Matthew Honnibal
|
e938bce320
|
Adjust parsing transition system to allow preset sentence segments.
|
2017-10-08 23:53:34 +02:00 |
|
Matthew Honnibal
|
668a0ea640
|
Pass extensions into Underscore class
|
2017-10-07 18:56:01 +02:00 |
|
ines
|
2480f8f521
|
Add missing return in Doc.from_disk() (closes #1330)
|
2017-09-18 15:32:00 +02:00 |
|
Matthew Honnibal
|
03b5b9727a
|
Fix Doc.vector for empty doc objects
|
2017-08-22 19:52:19 +02:00 |
|
Matthew Honnibal
|
0551b7b03a
|
Fix doc.vector
|
2017-08-22 19:46:52 +02:00 |
|
Matthew Honnibal
|
8b7ac77c23
|
Allow span label to be string in Doc.char_span
|
2017-08-19 16:18:09 +02:00 |
|
Matthew Honnibal
|
80236116a6
|
Add Doc.char_span method, to get a span by character offset
|
2017-08-19 12:21:09 +02:00 |
|
Matthew Honnibal
|
a6a2159969
|
Add slot for text categories to Doc
|
2017-07-22 00:34:15 +02:00 |
|
Matthew Honnibal
|
2a3bd5ee90
|
Fix fetching of noun chunk iterator
|
2017-06-04 15:53:05 -05:00 |
|
Matthew Honnibal
|
92ae36f84e
|
Improve way noun chunks iterator is looked up
|
2017-06-04 21:53:39 +02:00 |
|
Matthew Honnibal
|
675f448313
|
Fix vector linkage on Doc
|
2017-06-04 14:25:30 -05:00 |
|
ines
|
459a1e8470
|
Fix whitespace
|
2017-06-03 11:31:18 +02:00 |
|
ines
|
5109bba910
|
Port over fix from #1070
|
2017-06-03 11:31:11 +02:00 |
|
Matthew Honnibal
|
498ad85309
|
Try using tensor for vector/similarity methdos
|
2017-05-30 23:35:17 +02:00 |
|
Matthew Honnibal
|
4ddff020c3
|
Fix compile error
|
2017-05-28 23:30:40 +02:00 |
|
Matthew Honnibal
|
6d3caeadd2
|
Fix type check for long
|
2017-05-28 23:22:45 +02:00 |
|
Matthew Honnibal
|
7996d21717
|
Fixes for new StringStore
|
2017-05-28 11:09:27 -05:00 |
|
Matthew Honnibal
|
fe11564b8e
|
Finish stringstore change. Also xfail vectors tests
|
2017-05-28 15:10:22 +02:00 |
|
Matthew Honnibal
|
84e66ca6d4
|
WIP on stringstore change. 27 failures
|
2017-05-28 14:06:40 +02:00 |
|
ines
|
66088851dc
|
Add Doc.to_disk() and Doc.from_disk() methods
|
2017-05-24 11:58:17 +02:00 |
|
Matthew Honnibal
|
d44b1eafc4
|
Fix conflict artefacts
|
2017-05-23 18:47:11 +02:00 |
|
Matthew Honnibal
|
d68dd1f251
|
Add SENT_START attribute, for custom sentence boundary detection
|
2017-05-23 18:37:58 +02:00 |
|
ines
|
23f9a3ccc8
|
Update docstrings and API docs for Doc
|
2017-05-19 18:47:39 +02:00 |
|
ines
|
8455cb1327
|
Update docstring for Doc.__getitem__
|
2017-05-19 00:30:51 +02:00 |
|
ines
|
b687ad109d
|
Update docstrings and API docs for Doc class
|
2017-05-18 23:59:44 +02:00 |
|
ines
|
b87066ff10
|
Update docstrings and API docs for Doc class
|
2017-05-18 22:17:41 +02:00 |
|
ines
|
9d85cda8e4
|
Fix models error message and use about.__docs_models__ (see #1051)
|
2017-05-13 13:05:47 +02:00 |
|
ines
|
6b942763f0
|
Tidy up imports
|
2017-05-13 13:04:40 +02:00 |
|
ines
|
b9dea345e5
|
Remove old import
|
2017-05-13 12:32:11 +02:00 |
|
ines
|
293ee359c5
|
Fix formatting
|
2017-05-13 12:32:06 +02:00 |
|
Matthew Honnibal
|
ee1d35bdb0
|
Fix merge conflict
|
2017-05-13 03:20:19 +02:00 |
|
Matthew Honnibal
|
b2540d2379
|
Merge Kengz's tree_print patch
|
2017-05-13 03:18:49 +02:00 |
|
Matthew Honnibal
|
4efb391994
|
Fix serializer
|
2017-05-09 18:45:18 +02:00 |
|
Matthew Honnibal
|
1166b0c491
|
Implement Doc.to_bytes and Doc.from_bytes methods
|
2017-05-09 18:11:34 +02:00 |
|
Matthew Honnibal
|
9e167b7bb6
|
Strip serializer from code
|
2017-05-09 17:28:50 +02:00 |
|
ines
|
0739ae7b76
|
Tidy up and fix formatting and imports
|
2017-04-15 13:05:15 +02:00 |
|
ines
|
e71a1f4bd0
|
Fix download commands in error messages (see #946)
|
2017-04-01 10:20:57 +02:00 |
|
Matthew Honnibal
|
51882ee2b8
|
Fix check for setting ent_id in merge
|
2017-03-31 19:32:01 +02:00 |
|
Matthew Honnibal
|
9720103428
|
Improve attribute handlign in doc.merge(). Still unsatisfying
|
2017-03-31 13:59:58 +02:00 |
|
Matthew Honnibal
|
0fefdfcbda
|
Merge pull request #935 from ericzhao28/master
Add option to use label=ent_type in doc.merge arguments (Bug fix for issue #862)
|
2017-03-30 02:51:24 +02:00 |
|
Eric Zhao
|
aafdf6ffb8
|
Add option to use label karg to determine ent_type in doc.merge
|
2017-03-28 23:35:03 -07:00 |
|
Roman Inflianskas
|
66e1109b53
|
Add support for Universal Dependencies v2.0
|
2017-03-03 13:17:34 +01:00 |
|