Matthew Honnibal
|
ec63f4fe7b
|
Add option to control how missing entities are handled when getting NER tags
|
2017-07-29 21:58:37 +02:00 |
|
Matthew Honnibal
|
aff325b7e0
|
Increment version
|
2017-07-25 19:41:20 +02:00 |
|
Matthew Honnibal
|
6780132821
|
Fix tagger loading
|
2017-07-25 19:41:11 +02:00 |
|
Matthew Honnibal
|
fd20a4af55
|
Increment version
|
2017-07-25 18:58:34 +02:00 |
|
Matthew Honnibal
|
523b0df2c9
|
Update text classification model
|
2017-07-25 18:57:59 +02:00 |
|
Matthew Honnibal
|
7c7fac9337
|
Add spacy.blank() loading function
|
2017-07-25 18:56:37 +02:00 |
|
Matthew Honnibal
|
5771bd1ff8
|
Increment version
|
2017-07-23 14:18:38 +02:00 |
|
Matthew Honnibal
|
c4a81a47a4
|
Fix deserialization
|
2017-07-23 14:11:07 +02:00 |
|
Matthew Honnibal
|
2df563ad24
|
Remove optimization for textcat that caused loading problem
|
2017-07-23 14:10:51 +02:00 |
|
Matthew Honnibal
|
4fe77bced2
|
Add cfg attr to pipeline components
|
2017-07-23 00:52:47 +02:00 |
|
Matthew Honnibal
|
d8aa721664
|
Compute Language.meta with a property
|
2017-07-23 00:50:18 +02:00 |
|
Matthew Honnibal
|
a88a7deffe
|
Five save/load of textcat config
|
2017-07-23 00:33:43 +02:00 |
|
Matthew Honnibal
|
9bae0ddc50
|
Fix minibatching
|
2017-07-22 20:14:49 +02:00 |
|
Matthew Honnibal
|
ded0df5e2f
|
Expose hyper-param as keyword arg
|
2017-07-22 20:14:37 +02:00 |
|
Matthew Honnibal
|
f5de8deeec
|
Increment version
|
2017-07-22 20:04:53 +02:00 |
|
Matthew Honnibal
|
b55714d5d1
|
Make gold_tuples arg optional in begin_training
|
2017-07-22 20:04:43 +02:00 |
|
Matthew Honnibal
|
ed6c85fa3c
|
Fix loading of text categories in GoldParse
|
2017-07-22 20:04:03 +02:00 |
|
Matthew Honnibal
|
6ffec9dfea
|
Update _ml, for textcat model
|
2017-07-22 20:03:40 +02:00 |
|
Matthew Honnibal
|
d6a5c2c85a
|
Add test for NER
|
2017-07-22 01:48:58 +02:00 |
|
Matthew Honnibal
|
28244df4da
|
Add test for beam parsing
|
2017-07-22 01:48:35 +02:00 |
|
Matthew Honnibal
|
c86445bdfd
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-07-22 01:14:28 +02:00 |
|
Matthew Honnibal
|
b3a749610e
|
Fix name of TextCategorizer
|
2017-07-22 01:14:07 +02:00 |
|
Matthew Honnibal
|
2424493970
|
Remove unnecessary import of Mock
|
2017-07-22 01:13:54 +02:00 |
|
Matthew Honnibal
|
baa3d81c35
|
Add text categorizer to Language
|
2017-07-22 01:13:36 +02:00 |
|
Matthew Honnibal
|
a6a2159969
|
Add slot for text categories to Doc
|
2017-07-22 00:34:15 +02:00 |
|
Matthew Honnibal
|
374ab3ecfb
|
Increment alpha version
|
2017-07-22 00:32:49 +02:00 |
|
Matthew Honnibal
|
289f23df51
|
Test beam parsing
|
2017-07-20 15:03:10 +02:00 |
|
Matthew Honnibal
|
3da1063b36
|
Add beam decoding to parser, to allow NER uncertainties
|
2017-07-20 15:02:55 +02:00 |
|
Matthew Honnibal
|
0ca5832427
|
Improve negative example handling in NER oracle
|
2017-07-20 00:18:49 +02:00 |
|
Matthew Honnibal
|
a231b56d40
|
Add text-classification hook to pipeline
|
2017-07-20 00:18:15 +02:00 |
|
Matthew Honnibal
|
7ea50182a5
|
Add support for text-classification labels to GoldParse
|
2017-07-20 00:17:47 +02:00 |
|
Matthew Honnibal
|
727481377e
|
Add text-classifer thinc models
|
2017-07-20 00:17:17 +02:00 |
|
Matthew Honnibal
|
f014138c11
|
Fix parser tests
|
2017-07-20 00:16:52 +02:00 |
|
Ines Montani
|
c91642efd5
|
Port over changes from #1168
|
2017-07-01 11:43:54 +02:00 |
|
Jim Regan
|
d81ceb0cd5
|
Merge branch 'develop' into polish
|
2017-06-26 22:42:27 +01:00 |
|
Jim O'Regan
|
2f84c73585
|
a start
|
2017-06-26 22:40:04 +01:00 |
|
Jim O'Regan
|
28d7f0a672
|
reference
|
2017-06-26 22:38:28 +01:00 |
|
Matthew Honnibal
|
91e52543ef
|
Merge pull request #1118 from Gregory-Howard/patch-2
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-20 11:16:07 +02:00 |
|
Matthew Honnibal
|
8ea785e01a
|
Merge pull request #1119 from oroszgy/patch-3
Fixed conllu converter
|
2017-06-20 11:14:41 +02:00 |
|
Tpt
|
7745b3ae04
|
Adds noun chunks to French syntax iterators
|
2017-06-12 15:29:58 +02:00 |
|
Tpt
|
57e8254f63
|
Adds function to extract french noun chunks
|
2017-06-12 15:20:49 +02:00 |
|
György Orosz
|
62dbf9025c
|
Fixed conllu converter
|
2017-06-09 22:53:56 +02:00 |
|
Grégory Howard
|
cd974b32b7
|
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-09 17:58:18 +02:00 |
|
ines
|
34a2eecb17
|
Add simple "naughty strings" test (see #1107)
|
2017-06-06 17:43:51 +02:00 |
|
ines
|
045574a936
|
Update package name and increment version
|
2017-06-05 20:41:30 +02:00 |
|
Matthew Honnibal
|
1f5874a927
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-05 20:20:00 +02:00 |
|
ines
|
03db56f48c
|
Detect spaCy version and add package title
Package title allows customised package names (like spacy-nightly)
|
2017-06-05 20:11:02 +02:00 |
|
Matthew Honnibal
|
c0d90f52f7
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-05 19:20:13 +02:00 |
|
ines
|
cc9c5dc7a3
|
Fix noun chunks test
|
2017-06-05 16:39:04 +02:00 |
|
Matthew Honnibal
|
836bfa2d0f
|
Add factory for experimental SimilarityHook component
|
2017-06-05 15:40:22 +02:00 |
|
Matthew Honnibal
|
d59fa32df1
|
Add experimental SimilarityHook omponent
|
2017-06-05 15:40:03 +02:00 |
|
Matthew Honnibal
|
5489b49203
|
Remove print statement
|
2017-06-05 13:20:41 +02:00 |
|
Matthew Honnibal
|
fc4204a12a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-05 13:13:23 +02:00 |
|
Matthew Honnibal
|
2479cde446
|
Support disable keyword in Language.__init__
|
2017-06-05 13:13:07 +02:00 |
|
ines
|
ea167e14db
|
Fix model package loading from link
|
2017-06-05 13:10:49 +02:00 |
|
ines
|
dd6dc4c120
|
Update spacy.load() helper functions
|
2017-06-05 13:02:31 +02:00 |
|
Matthew Honnibal
|
b4cdd05466
|
Add vectors.pyx in setup
|
2017-06-05 12:45:29 +02:00 |
|
Matthew Honnibal
|
280d419529
|
Add pickle method for vectors
|
2017-06-05 12:36:04 +02:00 |
|
Matthew Honnibal
|
30369d580f
|
Start testing Vectors class
|
2017-06-05 12:32:49 +02:00 |
|
Matthew Honnibal
|
eb7cbb62c2
|
Flesh out Vectors class
|
2017-06-05 12:32:08 +02:00 |
|
ines
|
51d7414e94
|
Make sure sents are a list
|
2017-06-05 12:30:13 +02:00 |
|
Matthew Honnibal
|
ebb6c49cd5
|
Make alignment case-insensitive for gold
|
2017-06-04 20:26:42 -05:00 |
|
Matthew Honnibal
|
fc4dd62e84
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 20:19:05 -05:00 |
|
Matthew Honnibal
|
8f8f90b46b
|
Disable labeller if not parsing
|
2017-06-04 20:18:54 -05:00 |
|
Matthew Honnibal
|
c52fde40f4
|
Improve train CLI
|
2017-06-04 20:18:37 -05:00 |
|
Matthew Honnibal
|
a053b1218e
|
Fix item counting during training
|
2017-06-04 20:18:20 -05:00 |
|
Matthew Honnibal
|
b3b5521625
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 20:17:18 -05:00 |
|
Matthew Honnibal
|
9bc4a26213
|
Add option of data augmentation noise
|
2017-06-04 20:16:57 -05:00 |
|
Matthew Honnibal
|
7b2ede783d
|
Add SP tag to tag map if missing
|
2017-06-04 20:16:30 -05:00 |
|
ines
|
a0f4592f0a
|
Update tests
|
2017-06-05 02:26:13 +02:00 |
|
ines
|
3e105bcd36
|
Update tests
|
2017-06-05 02:09:27 +02:00 |
|
Matthew Honnibal
|
516798e9fc
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-05 01:35:21 +02:00 |
|
Matthew Honnibal
|
193bf913c0
|
Set is_tagged=True after tagging
|
2017-06-05 01:35:07 +02:00 |
|
ines
|
078232932c
|
Fix tokenizer fixture scope
|
2017-06-05 01:06:34 +02:00 |
|
Matthew Honnibal
|
58be0e1f6f
|
Update tests
|
2017-06-04 16:35:06 -05:00 |
|
Matthew Honnibal
|
b78cc318c3
|
Fix loading of morphology exceptions
|
2017-06-04 16:34:32 -05:00 |
|
Matthew Honnibal
|
bb98d45a63
|
Fix tests
|
2017-06-04 16:00:44 -05:00 |
|
Matthew Honnibal
|
55d0621532
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 15:53:25 -05:00 |
|
Matthew Honnibal
|
5b9f116aca
|
Update tests
|
2017-06-04 15:53:17 -05:00 |
|
Matthew Honnibal
|
2a3bd5ee90
|
Fix fetching of noun chunk iterator
|
2017-06-04 15:53:05 -05:00 |
|
Matthew Honnibal
|
3680c51b8f
|
Avoid clobbering preset POS tags
|
2017-06-04 15:52:42 -05:00 |
|
Matthew Honnibal
|
939e8ed567
|
Add lookup properties for components in Language
|
2017-06-04 15:52:09 -05:00 |
|
Matthew Honnibal
|
e28f90b672
|
Fix syntax iterators
|
2017-06-04 15:51:50 -05:00 |
|
ines
|
8a29308d0b
|
Remove unused imports
|
2017-06-04 22:39:29 +02:00 |
|
Ines Montani
|
112c5787eb
|
Merge pull request #1101 from oroszgy/hu_tokenizer_fix
More robust Hungarian tokenizer.
|
2017-06-04 22:37:51 +02:00 |
|
ines
|
96867a24ae
|
Fix typo
|
2017-06-04 22:36:40 +02:00 |
|
ines
|
f432bb4b48
|
Fix fixture scopes
|
2017-06-04 22:34:31 +02:00 |
|
Matthew Honnibal
|
6d0356e6cc
|
Whitespace
|
2017-06-04 14:55:24 -05:00 |
|
Matthew Honnibal
|
8a683a4494
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 21:53:56 +02:00 |
|
Matthew Honnibal
|
92ae36f84e
|
Improve way noun chunks iterator is looked up
|
2017-06-04 21:53:39 +02:00 |
|
ines
|
9254a3dd78
|
Import and add Spanish syntax iterators
|
2017-06-04 21:42:15 +02:00 |
|
ines
|
7db1a0e83e
|
Make sure printed values are always strings
|
2017-06-04 21:27:20 +02:00 |
|
Matthew Honnibal
|
51e1541ddb
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-04 14:26:29 -05:00 |
|
Matthew Honnibal
|
add9a33782
|
Return False for vocab.has_vector
|
2017-06-04 14:26:14 -05:00 |
|
Matthew Honnibal
|
675f448313
|
Fix vector linkage on Doc
|
2017-06-04 14:25:30 -05:00 |
|
Matthew Honnibal
|
f4662e9218
|
Fix vector linkage for token
|
2017-06-04 14:19:58 -05:00 |
|
ines
|
070e026ed9
|
Ensure path on read_json
|
2017-06-04 20:44:37 +02:00 |
|
ines
|
e1e73936b1
|
Raise correct error
|
2017-06-04 20:44:27 +02:00 |
|
ines
|
848e47669e
|
Fix typo
|
2017-06-04 20:44:15 +02:00 |
|
ines
|
c4614c02a2
|
Fix dev resources URL
|
2017-06-04 15:45:50 +02:00 |
|