spaCy/website/docs
Jan Jessewitsch e4dcac4a4b
Merging multiple docs into one (#5032)
* Add static method to Doc to allow merging of multiple docs.

* Add error description for the error that occurs if docs with different
vocabs (from different languages) are merged in Doc.from_docs().

* Add test for Doc.from_docs() implementation.

* Fix using numpy's concatenate in Doc.from_docs.

* Replace typing's type annotations in from_docs.

* Simply remove type annotations in from_docs.

* Add documentation for Doc.from_docs to api.

* Simplify from_docs, its test and the api doc for codebase consistency.

* Fix merging of Doc objects that end with whitespaces (Achieved by simply not setting the SPACY attribute on whitespace tokens). Remove two unnecessary imports of attributes.

* Add merging of user data from Doc objects in from_docs. Add user data test case to corresponding test. Add applicable warning messages.

* Fix incorrect setting of tokens idx by using concatenated spaces (again). Add test case to corresponding test.

* Add MORPH to attrs

* Update warnings calls

* Remove out-dated error from merge

* Rename space_delimiter to ensure_whitespace

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2020-07-03 11:32:42 +02:00
..
api Merging multiple docs into one (#5032) 2020-07-03 11:32:42 +02:00
images Start updating website for v3 [ci skip] 2020-07-01 21:26:39 +02:00
models Divide models into core and starters [ci skip] 2019-12-21 14:10:22 +01:00
usage Update matcher usage examples [ci skip] 2020-07-02 15:39:45 +02:00
index.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
styleguide.md 💫 v2.1.0 launch updates (only merge on launch!) (#3414) 2019-03-18 16:07:26 +01:00