Commit Graph

215 Commits

Author SHA1 Message Date
akYoung
c158cdb1da Corretions for model test example
The sentences of test data in sentence entailment example should be generated with integers limited to vocab_size.
2017-05-03 22:41:23 +08:00
Matthew Honnibal
2da16adcc2 Add dropout optin for parser and NER
Dropout can now be specified in the `Parser.update()` method via
the `drop` keyword argument, e.g.

    nlp.entity.update(doc, gold, drop=0.4)

This will randomly drop 40% of features, and multiply the value of the
others by 1. / 0.4. This may be useful for generalising from small data
sets.

This commit also patches the examples/training/train_new_entity_type.py
example, to use dropout and fix the output (previously it did not output
the learned entity).
2017-04-27 13:18:39 +02:00
Matthew Honnibal
0605b95f2e Merge branch 'master' of https://github.com/explosion/spaCy 2017-04-18 13:48:00 +02:00
Matthew Honnibal
2f84626417 Fix train_new_entity_type example 2017-04-18 13:47:36 +02:00
Ines Montani
e7ae3b7cc2 Fix formatting and typo (closes #967) 2017-04-16 23:56:12 +02:00
Ines Montani
734b0a4e4a Update train_new_entity_type.py 2017-04-16 23:42:16 +02:00
ines
264af6cd17 Add documentation 2017-04-16 20:37:46 +02:00
ines
c7adca58a9 Tidy up example and only save/test if output_directory is not None 2017-04-16 16:55:01 +02:00
Matthew Honnibal
40e3024241 Move standalone NER training script into examples directory 2017-04-15 16:13:42 +02:00
Matthew Honnibal
b9c26aae11 Remove neptune refs from new train example 2017-04-15 16:13:17 +02:00
Matthew Honnibal
c729d72fc6 Add new example for training new entity types 2017-04-15 16:11:06 +02:00
Matthew Honnibal
a7626bd7fd Tmp commit to example 2017-04-15 15:43:14 +02:00
Matthew Honnibal
97b83c74dc WIP on training example 2017-04-14 23:54:27 +02:00
Kumaran Rajendhiran
3f55d6afae Update README 2017-04-05 16:59:52 +05:30
Kumaran Rajendhiran
47d7137c83 Set max_length to 100 for demo and evaluate 2017-04-05 16:48:35 +05:30
Kumaran Rajendhiran
10e8dcdfdb Remove not needed parameters from function 2017-04-05 16:20:47 +05:30
Matthew Honnibal
07726cf0a6 Add example of standalone NER training 2017-03-19 15:01:38 +01:00
Matthew Honnibal
f028f8ad28 Remove unfinished examples 2017-02-18 11:04:41 +01:00
Matthew Honnibal
c031c677cc Remove unused model_dir option
As noted in #845, the `model_dir` argument was not being used. I've removed it for now, although it would be good to have this option restored and working.
2017-02-18 10:38:22 +01:00
Matthew Honnibal
16ce7409e4 Merge branch 'master' of https://github.com/explosion/spaCy 2017-01-31 13:27:34 -06:00
Matthew Honnibal
80aa4e114b Fix x keras deep learning example 2017-01-31 13:27:13 -06:00
Matthew Honnibal
ab70f6e18d Update NER training example 2017-01-27 12:27:10 +01:00
Ines Montani
853130bcf8 Update installation instructions (see #727) 2017-01-14 22:12:42 +01:00
Matthew Honnibal
5a319060b9 Merge branch 'master' of https://github.com/explosion/spaCy 2016-12-20 16:26:57 -06:00
Matthew Honnibal
7793e2ad82 Fix use of dropout in sentiment analysis LSTM example 2016-12-20 16:26:38 -06:00
Christos Savvopoulos
c19b83f6ae use model_dir inside of load_model 2016-12-12 20:23:24 +00:00
Christos Savvopoulos
93cf4af701 actually commit load_ner.py 2016-12-12 20:13:33 +00:00
Christos Savvopoulos
ad54a929f8 train_ner should save vocab; add load_ner example 2016-12-12 20:09:49 +00:00
Matthew Honnibal
d0c999e0ad Add config.py for paddle example 2016-11-20 23:24:51 +01:00
Matthew Honnibal
d75fe7c19a Update paddle example 2016-11-20 21:45:08 +01:00
Matthew Honnibal
1ef541ddff Add train.sh for paddle 2016-11-20 21:44:33 +01:00
Matthew Honnibal
001abe2b9d Update config.py 2016-11-20 03:45:51 +01:00
Matthew Honnibal
409a18bd42 Add paddle sentiment example 2016-11-20 03:35:23 +01:00
Matthew Honnibal
e7eac08819 Work on paddle example 2016-11-20 03:29:36 +01:00
Matthew Honnibal
1ed40682a3 Set vectors in chainer example 2016-11-19 18:42:58 -06:00
Matthew Honnibal
b701a08249 Fix embedding in chainer sentiment example 2016-11-19 19:05:37 +01:00
Matthew Honnibal
8a2de46fcb Fix GPU usage in chainer example 2016-11-19 10:58:00 -06:00
Matthew Honnibal
4c84aae571 Merge branch 'master' of https://github.com/explosion/spaCy 2016-11-19 02:41:17 -06:00
Matthew Honnibal
3195c52741 Add WIP Chainer sentiment analysis code. 2016-11-19 09:27:59 +01:00
Matthew Honnibal
ff5ab75f5e Add partial embedding updates to Parikh model, fix dropout, other corrections. 2016-11-18 06:32:12 -06:00
Matthew Honnibal
718e66a7b9 Minibatch the forward pass. THe output argmax is incorrect... 2016-11-16 06:15:28 -06:00
Matthew Honnibal
8f053fd943 Add flag to toggle GPU to DyNet code 2016-11-16 05:51:00 -06:00
Matthew Honnibal
3a31c3a961 Merge branch 'master' of https://github.com/explosion/spaCy 2016-11-16 05:49:42 -06:00
Kyle P. Johnson
d105771a07 Add setup directions for data dir
This script's data needs are not intuitive. I have added a note explaining that (a) it expects pos/neg polarity data, (b) the structure of the data dir (train/test), and (c) a standard resource for such polarity data.
2016-11-13 10:08:16 -08:00
Kyle P. Johnson
c8d3694e2d Ch lex.repvec to lex.vector
For preventing the AttributeError: `File "spacy/lexeme.pyx", line 159, in spacy.lexeme.Lexeme.repvec.__get__ (spacy/lexeme.cpp:5016)
AttributeError: lex.repvec has been renamed to lex.vector`
2016-11-13 09:54:42 -08:00
Matthew Honnibal
389e8b700e Fix conflict 2016-11-13 08:52:20 -06:00
Matthew Honnibal
12a7b05360 Merge branch 'master' of https://github.com/explosion/spaCy 2016-11-13 08:49:07 -06:00
Matthew Honnibal
ef76c28d70 Update dynet example to use minibatching 2016-11-13 08:48:43 -06:00
Matthew Honnibal
fb8acc1dfb Merge pull request #628 from chenb67/master
Remove theano dependency from parikh model + small bug fix
2016-11-14 01:28:22 +11:00
Chen Buskilla
738f38e8d6 remove theano dependency, using keras backend functions 2016-11-13 15:06:01 +02:00
Chen Buskilla
a592075720 fix parikh entailment test methods bug with settings 2016-11-13 14:53:55 +02:00
Matthew Honnibal
ae681aa555 Work on DyNet example 2016-11-13 13:45:21 +01:00
Matthew Honnibal
89df91846c Fix entailment example, and add a flag for BiRNN encoding. 2016-11-12 11:43:37 -06:00
Paul Spiegelhalter
edf77a9dae added import of build_model 2016-11-11 15:13:12 -08:00
Paul Spiegelhalter
0d7031a8f1 syntax error on two functions 2016-11-11 15:12:03 -08:00
Matthew Honnibal
ca996fc01a Add BiRNN for entailment
Hastily add bidirectional RNN to entailment example
2016-11-12 01:15:01 +11:00
Matthew Honnibal
1ef62f39ef Update README.md 2016-11-01 13:30:10 +11:00
Matthew Honnibal
967412fb85 Minor edit 2016-11-01 13:22:36 +11:00
Ines Montani
589fc73910 Update README.md 2016-11-01 03:19:15 +01:00
Matthew Honnibal
18aab4f71e Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-11-01 03:05:49 +01:00
Matthew Honnibal
6cf989ad26 Make the README more concise 2016-11-01 13:05:17 +11:00
Matthew Honnibal
45ebab4677 Rename inventory count example 2016-11-01 02:30:22 +01:00
Ines Montani
274cc0f08f Update README.md 2016-11-01 02:13:54 +01:00
Matthew Honnibal
0b7af54219 Rename entailment example 2016-11-01 01:52:11 +01:00
Matthew Honnibal
1b9c6240a7 Rename entailment example 2016-11-01 01:51:54 +01:00
Matthew Honnibal
58f7be93ee Draft readme for NLI example 2016-11-01 01:46:55 +01:00
Ines Montani
6b30475725 Add README.md to examples 2016-11-01 01:14:04 +01:00
Matthew Honnibal
de32b6e5b8 Add code for Keras NLI example 2016-10-31 23:54:28 +01:00
kendricktan
ba8841234a Fixed training examples
Changes:
1. train_ner won't crash if no data directory is not found
2. Fixed train_tagger expected spacy.gold.GoldParse, got list
2016-10-24 16:09:23 +10:00
kendricktan
9877f3298f updated training examples to v1.1.2 2016-10-24 11:53:33 +10:00
Matthew Honnibal
105aaadc07 Make deep_learning_keras example use sentences 2016-10-23 23:17:41 +02:00
Matthew Honnibal
1ae3bde58f Fix deep learning example code 2016-10-20 21:32:26 +02:00
kendricktan
f77b3dc677 Fixed train_parser examples when model_dir isn't None 2016-10-20 23:40:51 +10:00
kendricktan
d817d57219 Fixed train_ner examples when model_dir isn't None 2016-10-20 21:09:07 +10:00
Matthew Honnibal
213027a1a1 Fix deep learning example 2016-10-20 04:39:54 +02:00
Matthew Honnibal
5378949326 Fix example 2016-10-20 03:42:34 +02:00
Matthew Honnibal
d17546681c Fix deep learning tutorial 2016-10-20 03:21:56 +02:00
Matthew Honnibal
4c27958990 Fix bugs in deep_learning_keras example. 2016-10-20 02:49:14 +02:00
Matthew Honnibal
ca89fd0919 Update Keras deep learning tutorial 2016-10-19 19:37:09 +02:00
Matthew Honnibal
f60cefc048 Add first draft of spaCy+keras integration example. 2016-10-19 14:43:13 +02:00
Matthew Honnibal
c36e8676aa Move old examples 2016-10-16 21:56:32 +02:00
Matthew Honnibal
3fba897e0f Update train_parser example 2016-10-16 21:41:14 +02:00
Matthew Honnibal
f787cd29fe Refactor the pipeline classes to make them more consistent, and remove the redundant blank() constructor. 2016-10-16 21:34:57 +02:00
Matthew Honnibal
4e9727b474 Use new words keyword argument in Doc. 2016-10-16 18:16:25 +02:00
Matthew Honnibal
2508117553 Make train_parser example a bit simpler. 2016-10-16 17:58:37 +02:00
Matthew Honnibal
4574fe87c6 Add example for training parser 2016-10-16 17:05:55 +02:00
Matthew Honnibal
01b42c531f Update train_tagger script 2016-10-16 16:10:23 +02:00
Matthew Honnibal
e5151056cf Fix NER training example 2016-10-16 11:41:20 +02:00
Henning Peters
470cdf5bf9 remove deprecated LOCAL_DATA_DIR 2016-04-05 11:25:54 +02:00
Matthew Honnibal
9cd21ad5b5 Merge pull request #284 from olegzd/olegzd/example/inventoryCount
Added reloadable English() example for inventory counting
2016-03-25 09:48:47 +11:00
Matthew Honnibal
eaccbcda0f Fix bug in pos_tag.py script 2016-03-16 06:04:14 +11:00
Gus Hecht
feefe64ab2 added batch_size as keyword argument
There's probably a better default value....
2016-03-10 14:16:34 -08:00
Oleg Zdornyy
a774131671 Added reloadable English() example for inv. count 2016-03-09 19:35:55 -08:00
Henning Peters
37a7020904 move displacy to its own subdomain 2016-02-19 14:03:52 +01:00
Matthew Honnibal
dc61056183 * Fix parallel_parse script 2016-02-07 02:56:16 +01:00
Matthew Honnibal
18eaa44835 * Add parallel_parse example 2016-02-07 02:53:44 +01:00
Matthew Honnibal
9b303e158e * Add example file to show answer to Issue #252 2016-02-07 01:13:40 +01:00
Matthew Honnibal
bf5a7cc598 * Update train_pos_tagger example 2016-02-02 22:30:00 +01:00
Matthew Honnibal
a9fc35d3bf * Fix sentence iteration bug in pos_tag example 2015-12-05 20:25:12 +01:00
Matthew Honnibal
8c170dc3d7 * Add example multi-processing enabled POS tagging script 2015-12-04 21:43:03 +01:00
Matthew Honnibal
9d95c26179 * Add simple deep feed-forward neural network text classification example. 2015-10-19 23:44:49 +11:00
Matthew Honnibal
0bc48e82d0 * Add _handler to resolve Issue #123 2015-10-15 02:44:23 +11:00
Matthew Honnibal
5af4b62fe7 * Filter out phrases that consist of common, lower-case words. 2015-10-09 12:47:43 +11:00
Matthew Honnibal
4bbc8f45c6 * Fix multi word matcher 2015-10-09 02:02:37 +11:00
Matthew Honnibal
5890682ed1 * Fix multi_word_matches script 2015-10-08 13:59:32 +11:00
Matthew Honnibal
63bd17135f * Whitespace 2015-10-06 10:37:07 +11:00
Matthew Honnibal
e7c31f7eae * Tweak information extraction example 2015-10-06 10:35:49 +11:00
Matthew Honnibal
e4ba8a4b5a * Add multi word matching code 2015-10-06 09:06:52 +11:00
Matthew Honnibal
262c215b55 examples/information_extraction.py
* Add very simple information extraction snippet.
2015-10-01 22:27:57 +10:00
Matthew Honnibal
a6e2a44283 * Add clarifying comment 2015-09-27 18:17:41 +10:00
Matthew Honnibal
8c3ec4c140 * Add comments clarifying output of matcher example 2015-09-27 18:13:45 +10:00
Matthew Honnibal
c17e2f2f20 * Add examples for Matcher, to answer Issue #105. TODO: Integrate into docs properly. 2015-09-27 18:08:00 +10:00
Matthew Honnibal
d4a013ccab * Add a simple example POS tagger script 2015-09-22 15:56:14 +10:00
Matthew Honnibal
692a8d3e3c * Begin rewriting twitter_filter examples 2015-08-22 22:12:26 +02:00
Matthew Honnibal
5ee645d742 * Add Displacy mixin. Needs to pull the data from the web 2015-08-14 18:59:08 +02:00