1
1
mirror of https://github.com/explosion/spaCy.git synced 2025-04-02 16:24:13 +03:00
Commit Graph

8789 Commits

Author SHA1 Message Date
Ines Montani
7806deceb4 Fix typo (closes ) [ci skip] 2018-10-01 10:49:29 +02:00
John Stewart
9faea3ff10 Update Keras Example for (Parikh et al, 2016) implementation ()
* bug fixes in keras example

* created contributor agreement

* baseline for Parikh model

* initial version of parikh 2016 implemented

* tested asymmetric models

* fixed grevious error in normalization

* use standard SNLI test file

* begin to rework parikh example

* initial version of running example

* start to document the new version

* start to document the new version

* Update Decompositional Attention.ipynb

* fixed calls to similarity

* updated the README

* import sys package duh

* simplified indexing on mapping word to IDs

* stupid python indent error

* added code from https://github.com/tensorflow/tensorflow/issues/3388 for tf bug workaround
2018-10-01 10:28:45 +02:00
Ioannis Daras
405a826436 Correct error in spacy universe docs concerning spacy-lookup () 2018-10-01 10:24:50 +02:00
Filipe Caixeta
6c498f9ff4 Update Portuguese Language ()
* Add words to portuguese language _num_words

* Add words to portuguese language _num_words

* Portuguese - Add/remove stopwords, fix tokenizer, add currency symbols

* Extended punctuation and norm_exceptions in the Portuguese language
2018-09-29 09:51:45 +02:00
Matthew Honnibal
05b6103a0c Try to fix version pin for msgpack-numpy 2018-09-28 14:07:00 +02:00
Matthew Honnibal
6430b1fe64 Restore encoding arg on msgpack-numpy 2018-09-27 15:58:21 +02:00
Matthew Honnibal
276aa83d1a Require older msgpack-numpy 2018-09-27 15:34:24 +02:00
Matthew Honnibal
2ac69facc6 Fix Python 2 test failure 2018-09-27 15:34:16 +02:00
Matthew Honnibal
72778375fb Merge branch 'master' of https://github.com/explosion/spaCy 2018-09-27 13:54:49 +02:00
Matthew Honnibal
96fe314d8d Fix bug when too many entity types. Fixes 2018-09-27 13:54:34 +02:00
Suraj Rajan
bbdc6456c6 Set up dependency tree pattern matching skeleton () 2018-09-27 13:27:18 +02:00
Matthew Honnibal
8809dc4514 Remove deprecated encoding argument to msgpack 2018-09-27 12:56:23 +02:00
Matthew Honnibal
bae6b3e2b3 Merge branch 'master' of https://github.com/explosion/spaCy 2018-09-27 12:50:31 +02:00
Ines Montani
71cdbeada7 Revert "Also include lowercase norm exceptions"
This reverts commit 70f4e8adf3.
2018-09-27 12:29:25 +02:00
Charles-Axel Dein
014dd47c70 Add jupyter=True to displacy.render in documentation () 2018-09-27 12:28:04 +02:00
Przemysław Hojnacki
966b583d5e agreement of contributor, may I introduce a tiny pl languge contribution ()
* Contributors agreement

* Contributors agreement

* Contributors agreement
2018-09-27 12:25:22 +02:00
Charles-Axel Dein
94ad3c55f1 Add charlax's contributor agreement () 2018-09-27 12:24:42 +02:00
darindf
8227566805 Fix error ()
* Fix error
ValueError: cannot resize an array that references or is referenced
by another array in this way.  Use the resize function

* added spaCy Contributor Agreement
2018-09-26 21:31:03 +02:00
Ines Montani
5e0dfb34fa Merge branch 'master' of https://github.com/explosion/spaCy 2018-09-26 11:13:58 +02:00
Ines Montani
70f4e8adf3 Also include lowercase norm exceptions 2018-09-25 12:22:02 +02:00
Keshan
9a016d17c2 Adding basic support for Sinhala language. ()
* adding Sinhala language package, stop words, examples and lex_attrs.

* Adding contributor agreement

* Updating contributor agreement
2018-09-25 12:18:25 +02:00
Pranshu Jethmalani
9fd27d777e Fix typo () [ci skip]
Fixed typo on line 6 "regcognizer --> recognizer"
2018-09-25 12:12:40 +02:00
Ines Montani
3c4e3ade30 Fix typo (closes ) 2018-09-21 10:45:11 +02:00
mauryaland
68b3c544d5 Adding French hyphenated first name () 2018-09-21 10:38:13 +02:00
John Stewart
2d15859d2a Fixed spaCy+Keras example ()
* bug fixes in keras example

* created contributor agreement
2018-09-15 13:06:39 +02:00
Andrew Ongko
81564cc4e8 Update Indonesian model ()
* adding e-KTP in tokenizer exceptions list

* add exception token

* removing lines with containing space as it won't matter since we use .split() method in the end, added new tokens in exception

* add tokenizer exceptions list

* combining base_norms with norm_exceptions

* adding norm_exception

* fix double key in lemmatizer

* remove unused import on punctuation.py

* reformat stop_words to reduce number of lines, improve readibility

* updating tokenizer exception

* implement is_currency for lang/id

* adding orth_first_upper in tokenizer_exceptions

* update the norm_exception list

* remove bunch of abbreviations

* adding contributors file
2018-09-14 12:30:32 +02:00
Filipe Caixeta
fe515085f3 Add words to portuguese language _num_words ()
* Add words to portuguese language _num_words

* Add words to portuguese language _num_words
2018-09-14 12:30:16 +02:00
Ines Montani
5001d31be6 Don't set stop word in example (closes ) [ci skip] 2018-09-12 15:36:51 +02:00
Ines Montani
4e89cfaae1 Fix dependency scheme docs (closes ) [ci skip] 2018-09-12 15:32:26 +02:00
Ines Montani
0729d1edca Fix formatting 2018-09-12 15:32:08 +02:00
Ines Montani
907df53904 Add multi-threading note to Language.pipe (resolves ) [ci skip] 2018-09-12 15:03:30 +02:00
Ines Montani
885691a7ab
Describe converters more explicitly (see ) 2018-09-12 14:53:03 +02:00
Grivaz
aeba99ab0d Introduces a bulk merge function, in order to solve issue ()
* Fix comment

* Introduce bulk merge to increase performance on many span merges

* Sign contributor agreement

* Implement pull request suggestions
2018-09-10 16:41:42 +02:00
tyburam
476472d181 Lex _attrs for polish language ()
* Signed spaCy contributor agreement

* Added polish version of english lex_attrs
2018-09-10 11:53:57 +02:00
Sainath Adapa
77139bc03c Basic support for Telugu language () 2018-09-10 11:53:18 +02:00
Maxim Kupfer
97e2874225 added contributor agreement for mbkupfer () 2018-09-10 11:32:03 +02:00
Maxim Kupfer
cebe50b5b8 Remove ')' for clarity ()
Sorry, don't mean to be nitpicky, I just noticed this when going through the CLI and thought it was a quick fix. That said, if this was intention than please let me know.
2018-09-10 11:31:49 +02:00
Piotr Żelasko
bdb2165bd1 Less norm computations in token similarity ()
* Less norm computations in token similarity

* Contributor agreement
2018-09-05 21:50:23 +02:00
Aniruddha Adhikary
4530ddcc51 update bengali token rules for hyphen and digits () 2018-09-05 21:49:00 +02:00
Nathaniel J. Smith
26849874ad When calling getoption() in conftest.py, pass a default option ()
* When calling getoption() in conftest.py, pass a default option

This is necessary to allow testing an installed spacy by running:

  pytest --pyargs spacy

* Add contributor agreement
2018-09-03 09:57:52 +02:00
Ines Montani
e9022f7b33 Remove docstrings for deprecated arguments (see ) 2018-08-26 14:23:13 +02:00
Ines Montani
559f4139e3 Add FAC to spacy.explain (resolves ) 2018-08-26 14:13:50 +02:00
Steve Sharp
ca747f58a4 Update _install.jade ()
Typo fix: "models" -> "model"
2018-08-22 13:16:04 +02:00
Arya Prabhudesai
db2c2b286c Create aryaprabhudesai.md () 2018-08-20 18:56:14 +02:00
Ines Montani
c0fa9903f4 Update model directory JS [ci skip]
Prevent the default release URL from being overwritten and add license type
2018-08-16 16:54:50 +02:00
Ines Montani
fd9d175a53 Update live code [ci skip] 2018-08-15 15:28:48 +02:00
Matthew Honnibal
13fa550b36 Merge branch 'master' of https://github.com/explosion/spaCy 2018-08-14 02:32:01 +02:00
Ioannis Daras
fe94e696d3 Optimize Greek language support () 2018-08-14 02:31:32 +02:00
Wojciech Łukasiewicz
3953e967a0 User correct variable name in the examples ()
* correct naming

* add contributor agreement
2018-08-13 22:21:24 +02:00
Matthew Honnibal
85000ea13b Increment version to 2.0.13.dev2 2018-08-10 00:43:55 +02:00