spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-08-04 04:10:20 +03:00

Author	SHA1	Message	Date
Matthew Honnibal	66646ead26	Update travis	2017-09-04 18:14:15 +02:00
Matthew Honnibal	9d65d67985	Preserve model compatibility in parser, for now	2017-09-04 16:46:22 +02:00
Matthew Honnibal	d5fbf27335	Fix test	2017-09-04 16:45:11 +02:00
Matthew Honnibal	7fdafcc4c4	Fix config loading in tagger	2017-09-04 16:38:49 +02:00
Matthew Honnibal	058372d120	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-09-04 16:27:53 +02:00
Matthew Honnibal	16e25ce3b5	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-09-04 09:26:53 -05:00
Matthew Honnibal	9f512e657a	Fix drop_layer calculation	2017-09-04 09:26:38 -05:00
Matthew Honnibal	e8a26ebfab	Add efficiency note to new get_lca_matrix() method	2017-09-04 15:43:52 +02:00
Matthew Honnibal	cb4839033c	Fix loader for EN tests	2017-09-04 15:19:18 +02:00
Matthew Honnibal	382ce566eb	Fix deserialization bug	2017-09-04 15:19:01 +02:00
Matthew Honnibal	bfddf50081	Fix #1296 : Incorrect lemmatization of base form verbs	2017-09-04 15:18:41 +02:00
Matthew Honnibal	b29e6bff46	Improve lemmatization rule for am\|VBP	2017-09-04 15:18:10 +02:00
Matthew Honnibal	644d6c9e1a	Improve lemmatization tests, re #1296	2017-09-04 15:17:44 +02:00
Eric Zhao	d61c117081	Lowest common ancestor matrix for spans and docs Added functionality for spans and docs to get lowest common ancestor matrix by simply calling: doc.get_lca_matrix() or doc[:3].get_lca_matrix(). Corresponding unit tests were also added under spacy/tests/doc and spacy/tests/spans. Designed to address: https://github.com/explosion/spaCy/issues/969.	2017-09-03 12:22:19 -07:00
Matthew Honnibal	3cf3fa1704	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-09-02 12:46:11 -05:00
Matthew Honnibal	e920885676	Fix pickle during train	2017-09-02 12:46:01 -05:00
Matthew Honnibal	c0eaba8b28	Fix low-data textcat	2017-09-02 15:17:32 +02:00
Matthew Honnibal	9e378bdac5	Fix textcat serialization	2017-09-02 15:17:20 +02:00
Matthew Honnibal	e3ea6ee02b	Increment version	2017-09-02 15:17:01 +02:00
Matthew Honnibal	a3b69bcb3d	Add low_data mode in textcat	2017-09-02 14:56:30 +02:00
Matthew Honnibal	ead78c7b9b	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-09-02 12:55:25 +02:00
Matthew Honnibal	5e6a9e7dcc	Add rule-based SBD	2017-09-02 12:53:38 +02:00
Matthew Honnibal	a824cf8f9a	Adjust text classification model	2017-09-02 11:41:00 +02:00
Matthew Honnibal	9bffcaa73d	Update test to make it slightly more direct The `nlp` container should be unnecessary here. If so, we can test the tokenizer class just a little more directly.	2017-09-01 21:16:56 +02:00
Matthew Honnibal	ac040b99bb	Add support for pre-trained vectors in text classifier	2017-09-01 16:39:55 +02:00
Matthew Honnibal	7742a6d559	Add GloVe vectors reader	2017-09-01 16:39:22 +02:00
Matthew Honnibal	789e1a3980	Use 13 parser features, not 8	2017-08-31 14:13:00 -05:00
Matthew Honnibal	30e35d9666	Fix syntax error	2017-08-30 17:35:39 -05:00
Matthew Honnibal	4ceebde523	Fix gradient bug in parser	2017-08-30 17:32:56 -05:00
Vimos Tan	a6d9fb5bb6	fix issue #1292	2017-08-30 14:49:14 +08:00
Paul O'Leary McCann	8b3e1f7b5b	Handle out-of-vocab words Wasn't handling words out of the tokenizer dictionary vocabulary properly. This adds a fix and test for that. -POLM	2017-08-29 23:58:42 +09:00
ines	173089a45a	Add more validation for model meta	2017-08-29 11:21:46 +02:00
Matthew Honnibal	2e28982e28	Merge pull request #1288 from geovedi/indonesian Indonesian language support	2017-08-26 21:31:13 +02:00
ines	7e04b7f89c	Fix info text on pipeline in package cli	2017-08-26 18:30:59 +02:00
ines	40afa13a8a	Increment version	2017-08-26 18:30:49 +02:00
Matthew Honnibal	876f38c548	Merge pull request #1279 from oroszgy/model_cli_v2 Added vector loading to model cli	2017-08-26 15:57:50 +02:00
Matthew Honnibal	cfc055734e	Split % in units, for compatibility with corpus	2017-08-25 20:03:37 -05:00
Matthew Honnibal	4bb6bc3f9e	Add support for sent_start to GoldParse	2017-08-25 20:03:14 -05:00
Matthew Honnibal	44589fb38c	Fix Break oracle	2017-08-25 19:50:55 -05:00
Matthew Honnibal	6d4e8e14ca	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-08-25 12:37:16 -05:00
Matthew Honnibal	4ce5531389	Use layer norm instead of batch norm	2017-08-25 12:37:10 -05:00
Matthew Honnibal	20dd66ddc2	Constrain sentence boundaries to IS_PUNCT and IS_SPACE tokens	2017-08-25 19:35:47 +02:00
Jim Geovedi	58d8078971	Merge remote-tracking branch 'upstream/develop' into indonesian	2017-08-25 09:21:49 +08:00
Matthew Honnibal	6ceb0f0518	Allow Lexeme.rank to be set	2017-08-24 21:43:00 +02:00
Jeffrey Gerard	884ba168a8	Capture more noun chunks	2017-08-23 21:18:53 -07:00
Matthew Honnibal	44a1fa80d3	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-08-23 13:02:16 +02:00
ines	bb1abbeba5	Only link model if download was successfull	2017-08-23 12:36:31 +02:00
Matthew Honnibal	bb2541ffd3	Fix PROB attr for OOV words	2017-08-23 12:11:52 +02:00
Matthew Honnibal	1c5c256e58	Fix fine_tune when optimizer is None	2017-08-23 10:51:33 +02:00
Matthew Honnibal	9c580ad28a	Merge branch 'develop' of https://github.com/explosion/spaCy into develop	2017-08-22 17:02:04 -05:00

... 44 45 46 47 48 ...

8727 Commits