ines
3a321e79ac
Merge branch 'master' into develop
2018-07-10 13:49:08 +02:00
ines
71bfc92913
Exclude models for non-stable versions [ci skip]
2018-07-10 13:44:55 +02:00
ines
b5200962c0
Adjust formatting [ci skip]
2018-07-09 18:35:46 +02:00
Alex Villarreal
bd35bf7f09
Guidance to handle binary files in git in Windows ( #2526 )
...
Adds guidance on what to do if users encounter the error described in [1634](https://github.com/explosion/spaCy/issues/1634 ), which probably only happens in Windows environments.
2018-07-09 18:31:37 +02:00
ines
f575b01595
Update language and license meta [ci skip]
2018-07-04 15:09:36 +02:00
ines
63666af328
Merge branch 'master' into develop
2018-07-04 14:52:25 +02:00
Matthew Honnibal
a85620a731
Note CoreNLP tokenizer correction on website
2018-07-02 11:35:31 +02:00
ines
06c6dc6fbc
Update Juniper [ci skip]
2018-06-28 11:48:17 +02:00
Nipun Sadvilkar
741ba80bd5
Train model command n_iteration 20 -> 30 ( #2454 )
...
In source code `train.py` default Number of iterations is 30
2018-06-18 11:57:08 +02:00
ines
53a2bc8c8d
Only scroll sidebar item into view if needed [ci skip]
2018-06-12 10:58:50 +02:00
ines
65713a6593
Increment versions [ci skip]
2018-06-12 10:49:50 +02:00
Ines Montani
968f6f0bda
💫 Document Cython API ( #2433 )
...
## Description
This PR adds the most relevant documentation of spaCy's Cython API.
(Todo for when we publish this: rewrite `/api/#section-cython` and `/api/#cython` to `/api/cython#conventions`.)
### Types of change
docs
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-06-11 17:47:46 +02:00
GolanLevy
72d7e80f94
adding a missing apostrophe ( #2436 )
2018-06-11 17:47:24 +02:00
ines
778e5f4da3
Merge branch 'master' into develop
2018-06-11 00:38:04 +02:00
himkt
57311d5d47
replace janome with mecab in the documentation and the test ( #2415 )
...
* Add links to Reddit data (see #2401 )
* replace janome with mecab in the documentation and the test
* add the assignment
2018-06-11 00:33:13 +02:00
ines
effb55d591
Adjust formatting [ci skip]
2018-06-11 00:29:13 +02:00
Nathan Breit
ba6d2cf393
Add EpiTator to Universe ( #2429 )
2018-06-11 00:24:13 +02:00
himkt
1a568f2e08
fix wrong documentations ( #2423 )
2018-06-11 00:21:06 +02:00
Bohdan Moskalevskyi
d66292f767
fix UD data file extensions ( #2425 )
...
* fix UD data files extension
* add contributor agreement for msklvsk
2018-06-08 14:26:11 +02:00
ines
a0017e4909
Merge branch 'master' into develop
2018-05-30 14:10:47 +02:00
ines
0baaf836cf
Update formatting [ci skip]
2018-05-30 13:32:49 +02:00
ines
3913e18201
Add self-attentive-parser to universe (see #59 )
2018-05-30 13:31:28 +02:00
ines
4a62486340
Merge branch 'master' into develop
2018-05-30 13:01:01 +02:00
ines
605c663a4c
Fix HTML merger examples (see #2390 )
2018-05-30 12:22:32 +02:00
ines
d0b16aa014
Update list of languages
2018-05-26 18:56:26 +02:00
Samuel Pouyt
5f988b8e9c
Update _custom.jade ( #2372 )
...
It seems based on the doc and trying out that the `en` or `[lang]` is missing from the `spacy model-init`
2018-05-26 18:17:12 +02:00
ines
d84a830d79
Merge branch 'master' of https://github.com/explosion/spaCy
2018-05-26 17:57:05 +02:00
ines
fb923b31ea
Fix bad HTML example (see #2376 ) and turn it into section on matcher + components
...
Avoid problems caused by merging while matching (e.g. index errors). Creating a Matcher component also better reflects the recommended best practices.
2018-05-26 17:57:02 +02:00
Shantam Raj
592834183a
corrected spelling ( #2359 )
...
changed **interpretted** to **interpreted**
2018-05-24 13:29:52 +02:00
ines
8adb967e0c
Fix from source quickstart instructions for Windows
...
See: https://stackoverflow.com/a/50478036/6400719
2018-05-24 12:42:16 +02:00
Shantam Raj
1a4682dd0b
Update _training.jade ( #2340 )
...
* Update _training.jade
Correcting grammar. Replacing "The" with "To".
* Create armsp.md
* Update armsp.md
2018-05-21 11:09:33 +02:00
ines
ff1082d8e4
Add version tag in CLI docs [ci skip]
2018-05-21 01:17:49 +02:00
Ines Montani
d4cc736b7c
💫 Improve model downloads: check for existing install, customise pip and use requests library again ( #2346 )
...
* Go back to using requests instead of urllib (closes #2320 )
Fewer dependencies are good, but this one was simply causing too many other problems around SSL verification and Python 2/3 compatibility. requests is a popular enough package that it's okay for spaCy to depend on it – and this will hopefully make model downloads less flakey.
* Only download model if not installed (see #1456 )
Use #egg=model==version to allow pip to check for existing installations. The download is only started if no installation matching the package/version is found. Fixes a long-standing inconvenience.
* Pass additional options to pip when installing model (resolves #1456 )
Treat all additional arguments passed to the download command as pip options to allow user to customise the command. For example:
python -m spacy download en --user
* Add CLI option to enable installing model package dependencies
* Revert "Add CLI option to enable installing model package dependencies"
This reverts commit 9336ffe695
.
* Update documentation
2018-05-20 20:26:56 +02:00
vishnumenon
ae3719ece5
Fix the code for FACILITIY entities ( #2324 )
...
* Fix the code for FACILITIY entities
As far as I can tell, the default models all use "FAC" rather than "FACILITY"
* Added my Contributor Agreement
* Rename vishnumenon to vishnumenon.md
2018-05-12 15:19:17 +02:00
ines
ac25bc4016
Add docs section on sentence segmentation [ci skip]
2018-05-07 21:25:20 +02:00
ines
14148cd147
Fix formatting and wording
2018-05-07 21:24:35 +02:00
ines
f803da609f
Add scattertext [ci skip]
2018-05-07 19:10:23 +02:00
ines
c9547b7b8b
Update Juniper (see #2293 )
2018-05-03 15:36:02 +02:00
Alex Villarreal
647f2544c5
Fix code sample for span.set_extension ( #2286 )
2018-05-03 00:39:22 +02:00
Alex Villarreal
13d562e1a4
Fix code sample for Doc.set_extension ( #2282 )
...
* Fix code sample for `set_extension`
The previous sample code for `set_extension` fails the assertion at the end, because `city_getter` it checked if the whole document text matches any of the city names. Now it checks if any of the city names is contained in the document text.
* Contributor agreement
2018-05-02 10:16:05 +02:00
Shirish Kadam
d98a90440f
Added Adam project to spaCy Universe ( #2275 )
...
* Added 5hirish to contributors
* Added Adam Qas Project to spaCy Universe
* Remove $ from code example
2018-04-30 22:25:01 +02:00
ines
56e7faf16b
Fix spacing
2018-04-30 22:24:40 +02:00
ines
6efb4cdf88
Use Juniper and tidy up
2018-04-30 18:48:35 +02:00
ines
45bb8d75a5
Fix overflow issues on small screens [ci skip]
2018-04-29 03:17:36 +02:00
Ines Montani
49cee4af92
💫 Interactive code examples, spaCy Universe and various docs improvements ( #2274 )
...
* Integrate Python kernel via Binder
* Add live model test for languages with examples
* Update docs and code examples
* Adjust margin (if not bootstrapped)
* Add binder version to global config
* Update terminal and executable code mixins
* Pass attributes through infobox and section
* Hide v-cloak
* Fix example
* Take out model comparison for now
* Add meta text for compat
* Remove chart.js dependency
* Tidy up and simplify JS and port big components over to Vue
* Remove chartjs example
* Add Twitter icon
* Add purple stylesheet option
* Add utility for hand cursor (special cases only)
* Add transition classes
* Add small option for section
* Add thumb object for small round thumbnail images
* Allow unset code block language via "none" value
(workaround to still allow unset language to default to DEFAULT_SYNTAX)
* Pass through attributes
* Add syntax highlighting definitions for Julia, R and Docker
* Add website icon
* Remove user survey from navigation
* Don't hide GitHub icon on small screens
* Make top navigation scrollable on small screens
* Remove old resources page and references to it
* Add Universe
* Add helper functions for better page URL and title
* Update site description
* Increment versions
* Update preview images
* Update mentions of resources
* Fix image
* Fix social images
* Fix problem with cover sizing and floats
* Add divider and move badges into heading
* Add docstrings
* Reference converting section
* Add section on converting word vectors
* Move converting section to custom section and fix formatting
* Remove old fastText example
* Move extensions content to own section
Keep weird ID to not break permalinks for now (we don't want to rewrite URLs if not absolutely necessary)
* Use better component example and add factories section
* Add note on larger model
* Use better example for non-vector
* Remove similarity in context section
Only works via small models with tensors so has always been kind of confusing
* Add note on init-model command
* Fix lightning tour examples and make excutable if possible
* Add spacy train CLI section to train
* Fix formatting and add video
* Fix formatting
* Fix textcat example description (resolves #2246 )
* Add dummy file to try resolve conflict
* Delete dummy file
* Tidy up [ci skip]
* Ensure sufficient height of loading container
* Add loading animation to universe
* Update Thebelab build and use better startup message
* Fix asset versioning
* Fix typo [ci skip]
* Add note on project idea label
2018-04-29 02:06:46 +02:00
ines
a512fa60ef
Remove upcoming option from docs for now
2018-04-28 23:32:18 +02:00
ines
6fb6371670
Add collapse_phrases option to displacy ( closes #2266 )
2018-04-28 23:06:50 +02:00
Matt Upson
87cc6b3599
Add missing comma to NN example in docs ( #2255 )
...
Also add a completed contributor agreement.
2018-04-28 14:56:00 +02:00
ines
4a3bea00c7
Update resources [ci skip]
2018-04-26 22:10:34 +02:00
Pradeep Kumar Tippa
df389e5b74
spacy-101 vocab doc giving valid variable names ( #2236 )
2018-04-18 14:54:26 -07:00
ines
ce63f8997b
Update init-model docs
2018-04-10 21:42:54 +02:00
ines
0e847d7fe5
Fix typo
2018-04-09 14:51:14 +02:00
ines
de137fba84
Add TensorBoard examples to examples overview [ci skip]
2018-04-03 16:01:52 +02:00
ines
6d87b28f15
Add Vietnamese to language overview [ci skip]
2018-04-03 16:01:36 +02:00
ines
9615ed5ed7
Update emoji/hashtag matcher example ( resolves #2156 ) [ci skip]
2018-03-28 18:41:28 +02:00
ines
ce6071ca89
Remove ftfy dependency and update docs
2018-03-28 12:09:42 +02:00
ines
5ecc60cf3b
Add book to resources [ci skip]
2018-03-24 17:12:56 +01:00
ines
53680642af
Port over docs changes [ci skip]
2018-03-24 17:12:48 +01:00
Matthew Honnibal
f9f46e5a07
Revert matcher fixes from GregDubbin
2018-02-18 10:59:28 +01:00
ines
612c79a4f5
Update first matcher example and match_id ( resolves #1989 )
2018-02-17 11:57:38 +01:00
ines
ca56fb53d1
Add user survey to navigation [ci skip]
2018-02-15 12:14:30 +01:00
ines
cab5b775e7
Document ENT_TYPE matcher attribute [ci skip]
2018-02-15 12:14:19 +01:00
Pradeep Kumar Tippa
416cd021ce
Added TAG from spacy symbols which used below
2018-02-09 19:16:59 +05:30
Pradeep Kumar Tippa
01cc9cd9c0
assert statement syntax fix in doc
2018-02-09 19:16:25 +05:30
Pradeep Kumar Tippa
a78062e466
Merge remote-tracking branch 'upstream/master' into web-doc-patches
2018-02-09 19:13:19 +05:30
ines
ab33e274f5
Add more details on symlink error & Windows solution ( resolves #1941 ) [ci skip]
2018-02-09 10:43:33 +01:00
ines
8eaa934382
Merge branch 'master' of https://github.com/explosion/spaCy
2018-02-09 10:23:36 +01:00
ines
e9f67be04d
Fix regex flag matcher example ( resolves #1950 )
2018-02-09 10:23:33 +01:00
ines
fc4ae04c55
Document LENGTH attribute in matcher
2018-02-09 10:23:03 +01:00
Pradeep Kumar Tippa
8a7467b26e
Merge remote-tracking branch 'upstream/master' into web-doc-patches
2018-02-09 13:54:26 +05:30
Orion Montoya
24af6375db
update link to Honnibal and Johnson 2015
...
aclweb.org is throwing a gateway timeout on the link as `https`+`aclweb.org`, but is fine with `https`+`www.aclweb.org` (also with `http`+`aclweb.org`, but let's keep it in `https`, shall we?
2018-02-08 10:49:09 -08:00
Pradeep Kumar Tippa
03113d6779
Fixing navigating parse tree doc under dependency parse
2018-02-08 19:34:15 +05:30
ines
a3b965b29d
Remove UPPER from Matcher attributes docs ( resolves #1949 )
2018-02-08 11:29:27 +01:00
ines
696ae87b47
Fix whitespace
2018-02-08 11:28:54 +01:00
ines
26bc75134d
Fix typo
2018-02-08 11:28:44 +01:00
Pradeep Kumar Tippa
da9d687e75
Fixing typo from taining to training
2018-02-07 16:49:25 +05:30
Pradeep Kumar Tippa
ed7d268e93
Fixing vocab doc
...
Replacing "like" with "love", coffee suffix should be "fee" but not "ffe"
2018-02-07 14:55:12 +05:30
ines
f377c483e4
Add note on manual entity order in displaCy [ci skip]
2018-02-07 01:08:42 +01:00
ines
58eb178667
Update Doc.char_span docs [ci skip]
2018-02-07 01:08:30 +01:00
sayf eddine hammemi
86e7727855
Fix typo in the word build.
2018-02-04 20:48:45 +01:00
ines
901bc0e85f
Add Persian to list of languages [ci skip]
2018-02-01 04:47:34 +01:00
Hassan Shamim
a0b912c528
fix broken link to test suite models
2018-01-30 15:01:01 -08:00
greg
daefed0a34
Correct documentation of '+' and '*' ops
2018-01-22 15:55:44 -05:00
ines
67ba73351d
Fix typo and use better serialization example ( resolves #1851 ) [ci skip]
2018-01-16 18:42:03 +01:00
ines
7943a8e90c
Add spacy-lookup by @mpuig [ci skip]
2018-01-16 00:28:46 +01:00
ines
5684206154
Add LanguageCrunch by @artpar [ci skip]
2018-01-15 16:14:26 +01:00
Mateusz Tatusko
dda0e58c11
Update _pos-tags.jade
...
really small changes to English tags description, but might help some people while working on projects
1) -PRB- should be -RRB- instead
2) space gets tagged as _SP, and not SP
2018-01-15 12:01:51 +09:00
ines
0536e91564
Add note on Tagger.tag_names vs. Tagger.labels (see #1666 ) [ci skip]
2018-01-14 14:37:19 +01:00
ines
bbee48080d
Clarify hyperparameters and alias usage in spacy train ( resolves #1838 ) [ci skip]
2018-01-14 14:32:50 +01:00
ines
4daba3abda
Add regex section to rule-based matching docs (see #1567 , #1833 ) [ci skip]
2018-01-14 14:22:13 +01:00
Ines Montani
36f426fe0a
Merge pull request #1808 from fucking-signup/master
...
Fix issue #1769
2018-01-12 21:12:02 +00:00
ines
cfac5b955f
Fix aligment issues with newsletter signup form
2018-01-12 22:06:44 +01:00
ines
65babd9e2e
Fix typo, formatting and operator descriptions ( resolves #1820 )
2018-01-12 22:06:27 +01:00
Matthew Honnibal
a2a06dce24
Merge pull request #1792 from explosion/feature-improve-model-download
...
💫 Improve model downloading and linking
2018-01-11 20:02:08 +01:00
Ines Montani
11676b47f2
Merge pull request #1828 from wrathagom/patch-1
...
Small Grammar Fix to _basics.jade
2018-01-11 17:27:23 +00:00
pbnsilva
4cfd848bc3
Fixes typo in PhraseMatcher API docs
2018-01-11 17:35:59 +01:00
Caleb M. Keller
e68f6bf890
Small Grammar Fix to _basics.jade
...
Fixed an incorrect word order.
2018-01-11 09:26:47 -05:00
Matthew Honnibal
7ca49c2061
Merge branch 'master' into feature-improve-model-download
2018-01-10 18:21:55 +01:00
Kit
db6e4ba72e
Update code example according to new changes
2018-01-08 03:45:56 +01:00
ines
ef210c73dd
Update cli.download and cli.validate docs
2018-01-03 21:34:03 +01:00
ines
cc9df10e69
Document util.set_lang_class (see #1737 )
2018-01-03 20:13:25 +01:00
Ines Montani
874f174ab1
Merge pull request #1790 from nirdesh37/patch-1
...
Update goldparse.jade
2018-01-03 18:37:07 +00:00
ines
1fa6ba8130
Fix Doc.from_array example to make it work (see #1527 )
2018-01-03 16:59:38 +01:00
ines
49635350f0
Add .from_disk() to pipeline component init example ( resolves #1728 )
2018-01-03 16:50:24 +01:00
ines
95063ba26b
Update tests documentation ( resolves #1781 )
2018-01-03 16:42:26 +01:00
nirdesh37
67fdceed6a
Update goldparse.jade
2018-01-03 17:25:21 +05:30
Martin Andrews
e4355dade2
Documentation example fix : token.head needs '==' rather than 'is'
...
(similar change to #1689 , it seems).
2017-12-18 18:12:10 +08:00
Kristofer Berggren
1cb8c997fb
Fix typo Span -> Token on Token API page
...
Change Span.vector_norm to Token.vector_norm.
2017-12-17 20:32:19 +08:00
Ines Montani
4befd8bd44
Merge pull request #1724 from mpuels/patch-7
...
doc: Fix minor mistakes
2017-12-17 12:09:17 +00:00
ines
21482b391b
Fix head
2017-12-16 13:48:19 +01:00
mpuels
b3df2a2ffd
doc: Fix minor mistakes
2017-12-14 20:55:59 +01:00
mpuels
3f7bedadee
doc: Fix minor mistakes
2017-12-13 11:37:24 +01:00
ines
24e80c51b8
Document init-model command
2017-12-07 10:14:37 +01:00
mpuels
e3af19a076
doc: Replace 'is not' with '!=' in code example
...
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:
import spacy
nlp = spacy.load('en')
doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
def dependency_labels_to_root(token):
"""Walk up the syntactic tree, collecting the arc labels."""
dep_labels = []
while token.head is not token:
dep_labels.append(token.dep)
token = token.head
return dep_labels
dep_labels = dependency_labels_to_root(doc[1])
dep_labels
Replacing `is not` with `!=` solves the issue:
import spacy
nlp = spacy.load('en')
doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
def dependency_labels_to_root(token):
"""Walk up the syntactic tree, collecting the arc labels."""
dep_labels = []
while token.head != token:
dep_labels.append(token.dep)
token = token.head
return dep_labels
dep_labels = dependency_labels_to_root(doc[1])
dep_labels
The output is
['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
mpuels
82e575ebfb
doc: Fix assert statement in Lightning Tour
...
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement ), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
mpuels
662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
...
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines
b078e276e6
Document offsets_from_biluo_tags
2017-12-06 13:40:51 +01:00
ines
fb663f9b7d
Add Russian to list of languages
2017-12-06 13:40:32 +01:00
ines
58a19518cf
Merge branch 'master' of https://github.com/explosion/spaCy
2017-12-05 13:17:58 +01:00
ines
7ade336ab7
Add "Unknown locale" issue to troubleshooting guide (see #1684 , #1641 , #1517 )
2017-12-05 13:17:55 +01:00
Mark Dodwell
9d4c185860
Fix link to CLEAR Style dependency labels PDF
2017-12-04 23:28:06 -08:00
ines
40638b7cdf
Update resources
2017-12-02 04:16:03 +01:00
ines
9ea8a7cf0c
Add spacy_cld to extensions
2017-12-01 23:21:33 +01:00
ines
8d3f29322f
Add spacy_hunspell to resources (see #315 )
2017-11-29 09:33:22 +01:00
atomobianco
f6a82da907
Corrected char index instead of token index
...
Changed the index used to add the label because `displacy.render` apparently uses char index
2017-11-26 23:55:25 +01:00
ines
bda6e2a816
Add training example to lightning tour
2017-11-26 18:04:18 +01:00
ines
89f8b1fba0
Update example documents
2017-11-26 18:04:04 +01:00
ines
65d66b81f1
Fix typo
2017-11-26 18:03:44 +01:00
ines
e4ee666be5
Fix biluo_tags_from_offsets example and docs
2017-11-26 16:37:32 +01:00
ines
434030e0d0
Fix requirements.txt example (see #1638 )
2017-11-26 15:53:19 +01:00
Matthew Honnibal
6bc9917a0e
Another small fix to component docs
2017-11-23 11:47:20 +01:00
markulrich
c9b63c0dfc
Use correct local parameter in example MyComponent (and added markulrich.md contributor file)
2017-11-22 15:59:08 -08:00
ines
4f7e64e371
Update resources
2017-11-18 02:53:00 +01:00
ines
c3051e95f7
Add note on attribute extension defaults ( resolves #1587 )
2017-11-17 19:14:29 +01:00
ines
954f8cc6d1
Update syntax theme (should move the modifications out to an extension sometime)
2017-11-17 19:13:53 +01:00
Raphaël Bournhonesque
a0793fd4cc
Fix typo
2017-11-17 17:57:55 +01:00
Martino Mensio
ce1aade41e
small typo on docs
2017-11-17 16:20:22 +01:00
pavillet
ad2935f0c3
Update _spacy.jade
...
Doc example gives 'object is not subscriptable' error.
Correcting as an attribuet
2017-11-17 00:02:20 +01:00
ines
40c4e8fc09
Remove "optional" from dev_data arg and add more info (see #1578 )
2017-11-14 20:26:05 +01:00
KMLDS
d5b20ac3b6
Update span.jade
2017-11-13 19:27:20 -05:00
ines
bc79274706
Fix typo
2017-11-13 17:00:03 +01:00
ines
7a7b01feb1
Update links
2017-11-13 08:30:06 +01:00
ines
b3e502a076
Add videos section to resources
2017-11-13 08:29:57 +01:00
ines
f2b6b98b75
Fix typo in code example ( resolves #1556 )
2017-11-13 08:29:16 +01:00
ines
ceb2c596f1
Update conda details
2017-11-11 13:07:00 +01:00
ines
4a97def06a
Update features
2017-11-10 19:05:10 +01:00
ines
dea5636d6c
Fix broken links
2017-11-10 13:06:38 +01:00
Wahib Faizi
0da56f8ef8
Fix typo. Add missing '='.
2017-11-10 14:51:24 +03:00
ines
4c5d2c80d5
Re-add python -m to commands, too brittle :( (see #1536 )
2017-11-10 02:30:55 +01:00
ines
ee5697a1cd
Fix training tips
2017-11-10 00:19:42 +01:00
ines
6ae0ebfa3a
Update training tips
2017-11-10 00:17:10 +01:00
ines
b20779bac4
Update resources
2017-11-09 23:05:37 +01:00
ines
ed84688935
Remove old link
2017-11-09 15:34:12 +01:00
Ines Montani
e5b9ccdb5c
Merge pull request #1526 from mcsalgado/fix-typos
...
fix typos
2017-11-09 15:33:55 +01:00
Victor Salgado
fe1d969d5f
fix typos
2017-11-09 10:55:13 -02:00
Mathias Deschamps
25b26f0d64
Fix similarity visual
...
Doc was showing similarity when dissimilar
2017-11-09 11:08:26 +01:00
ines
98767122a7
Fix typos
2017-11-09 04:13:03 +01:00
ines
e87eb11beb
Update package.json
2017-11-09 04:12:57 +01:00
ines
33b84f4c39
Change clear_vectors to reset_vectors ( resolves #1516 )
2017-11-08 18:11:23 +01:00
ines
97a5892347
Document Vectors.resize() and update v2 incompatibilities ( resolves #1514 )
2017-11-08 17:11:11 +01:00
ines
c0a7a32bf8
Add en.stop_words change to v2 docs ( resolves #1512 )
2017-11-08 16:30:46 +01:00
ines
9b09b6b0cd
Fix formatting
2017-11-08 16:30:23 +01:00
ines
f0bdfb4471
Fix vector listing for core sm models in list overview (see #1513 )
2017-11-08 16:24:27 +01:00
ines
94cd3d51db
Update v2 docs and model info
...
Take out speed tables until we fix our benchmark tests on CPU and GPU
2017-11-08 11:43:00 +01:00
ines
14f97cfd20
Add note on stream processing to migration guide (see #1508 )
2017-11-08 01:53:36 +01:00
ines
5d1162cf21
Improve nlp.update / training loop overview (see #1507 )
2017-11-08 01:17:42 +01:00
ines
2229aba71c
Update website
2017-11-08 01:06:30 +01:00
ines
1768703e1c
Update website for v2.0
2017-11-07 14:48:17 +01:00
ines
e4a05385d6
Update docs
2017-11-07 12:33:43 +01:00
ines
a4662a31a9
Move model package templates to cli.package and update docs
2017-11-07 12:15:35 +01:00
ines
a09c096d3c
Get docs ready for v2.0.0
2017-11-07 12:00:43 +01:00
ines
173b1551af
Update examples
2017-11-07 01:22:30 +01:00
ines
c37837cad1
Update training docs
2017-11-07 01:06:31 +01:00
ines
c7bda87b17
Update model docs and add tips section
2017-11-07 01:05:37 +01:00
ines
a1261e8632
Fix formatting
2017-11-07 01:05:30 +01:00
ines
912c1b1821
Document "simple training style"
2017-11-07 00:23:19 +01:00
ines
ad6438ccdf
Update aside labels and under construction mixin
2017-11-07 00:23:00 +01:00
ines
8fb48b9b91
Update and document new util functions
2017-11-07 00:22:43 +01:00
ines
6447b8e396
Update v2 details
2017-11-06 21:15:36 +01:00
ines
008d7408cf
Make vectors vs. tensors more explicit in 101 (see #1498 )
2017-11-06 20:16:38 +01:00
ines
71852d3f25
Fix code mixins
2017-11-06 20:16:19 +01:00
ines
3b0699c9fe
Update benchmarks and data table style
2017-11-06 19:36:02 +01:00
ines
ddff7dc474
Update GPU install docs
2017-11-06 19:35:36 +01:00
ines
64d0f97c67
Update benchmarks and models
2017-11-06 18:19:00 +01:00
Matthew Honnibal
6fdffd7246
Merge pull request #1497 from explosion/feature/improve-optimizer-handling
...
💫 Improve optimizer handling
2017-11-06 16:41:15 +01:00
ines
972298e0c9
Update Pipe component docs and training API
2017-11-06 14:42:24 +01:00
ines
f48e1973ed
Fix accuracy table descriptions
2017-11-06 14:12:11 +01:00
ines
2d85ee6b5d
Fix broken link
2017-11-06 13:27:30 +01:00
ines
efb0a7e934
Fix broken links
2017-11-06 13:20:36 +01:00
ines
42a99eae02
Update troubleshooting guide
2017-11-06 13:17:09 +01:00
ines
2dca9e71a1
Add notes on catastrophic forgetting (see #1496 )
2017-11-06 13:17:02 +01:00
ines
e68d31bffa
Update models quickstart usage example
2017-11-06 13:06:26 +01:00
ines
2fe2c4942f
Update models directory and listing
2017-11-06 13:04:29 +01:00
ines
df1bdc7173
Add Dutch model
2017-11-06 02:44:59 +01:00
ines
333bef482f
Update pattern for Prism.js Python
2017-11-06 02:44:24 +01:00
ines
6b08aefd0c
Update formatting and styleguide
2017-11-05 23:31:31 +01:00
ines
e61a067c4b
Update v2 docs
2017-11-05 21:41:56 +01:00
ines
86d6bd7503
Fix wording
2017-11-05 19:23:50 +01:00
ines
6742657c4d
Fix website asset versioning
2017-11-05 19:23:45 +01:00
ines
2ca82d1f6e
Take out pt_core_news_sm for now
2017-11-05 18:57:04 +01:00
ines
a6ffa942bb
Update UD schemes
2017-11-05 18:46:24 +01:00
ines
3fa8900a6b
Don't include tag and label schemes in usage guide
2017-11-05 18:21:49 +01:00
ines
4810be4b44
Update POS scheme docs and add links for other schemes
2017-11-05 18:16:34 +01:00
ines
e7d0641125
Update POS row mixins
2017-11-05 18:16:16 +01:00
ines
15de2bb01d
Update and simplify other annotation scheme data
2017-11-05 16:09:48 +01:00
ines
2d59dd374b
Use collapsible sections for pos/dep scheme and update
...
Will ensure better overview as we add more schemes for more languages
2017-11-05 16:09:30 +01:00
ines
a9c77e01b4
Add accordion component (collapsible section)
2017-11-05 16:08:13 +01:00
ines
3d4dff1845
Remove comment
2017-11-05 16:07:14 +01:00
ines
b53c2010db
Add global focus style for links
2017-11-05 16:07:00 +01:00
ines
f092506578
Use hidden attribute instead of style.display
2017-11-05 16:06:50 +01:00
ines
0e8157674a
Add Portuguese and French
2017-11-04 23:07:21 +01:00
ines
d9fa3c6054
Update adding languages example
2017-11-04 15:12:39 +01:00
ines
c83fe54f0c
Update venv docs in installation instructions
2017-11-04 14:27:55 +01:00
ines
2940938bd8
Use more distinct style for checkboxes in quickstart
2017-11-04 14:24:30 +01:00
ines
4793d56a3e
Update commands for building from source
2017-11-04 14:24:14 +01:00
ines
177bf4ee39
Update GitHub topic links
2017-11-04 14:02:28 +01:00
ines
2639ecd5f8
Add docs note on custom tokenizer rules (see #1491 )
2017-11-03 23:33:18 +01:00
ines
380f2441b4
Fix script includes
2017-11-03 18:51:03 +01:00
Abhinav Sharma
c740277f9f
Minor typo [ nad => and ]
2017-11-03 16:30:44 +05:30
ines
1e16374687
Update models list to reflect spaCy v2.0.0a18
2017-11-03 11:29:34 +01:00
ines
a62b0727d8
Tidy up and always use bundle in built site for now
...
Just to be safe
2017-11-03 11:29:21 +01:00
ines
d0f88af5b6
Hide error earlier
2017-11-03 11:29:04 +01:00
ines
43512c68b2
Fix vector details in model overview
2017-11-02 20:04:13 +01:00
ines
9baab241b4
Add skeleton language data for Turkish
2017-11-02 16:32:24 +01:00
ines
31e349a62c
Update model families
2017-11-02 16:13:38 +01:00
ines
15cbc61a6e
Adjust rendering of large numbers
...
1234 -> 1.2k
12345 -> 12.3k
123456 -> 123k
1234567 -> 1.2m
2017-11-02 16:13:18 +01:00
ines
391fce09d9
Update licenses
2017-11-01 23:04:40 +01:00
ines
c6fea3e5f6
Add Romanian and Croatian skeletons (experimental)
...
Add language data templates to make it easier for others to contribute to the language support
2017-11-01 23:04:28 +01:00
ines
408f450ce0
Tidy up
2017-11-01 23:01:12 +01:00
ines
2fa53b39d5
Add dev dependency
2017-11-01 23:01:06 +01:00
ines
1976fb157f
Update licenses
2017-11-01 21:49:57 +01:00
ines
2ba4e4fc88
Fix broken links and add check_links shortcut script
2017-11-01 21:11:10 +01:00
ines
e5a4c31bb4
Adjust code line height
2017-11-01 19:49:42 +01:00
ines
5dd0d6a383
Update lightning tour
2017-11-01 19:49:36 +01:00
ines
9b4c38fe9f
Add button option to terminal component
2017-11-01 19:49:27 +01:00
ines
12954ab218
Don't document the tensorizer for now
2017-11-01 19:49:04 +01:00
ines
a7a76ea8c5
Update backwards incompatibilities
...
Also add separate section for deprecated
2017-11-01 16:31:57 +01:00
ines
4f77bb8476
Fix error handling
2017-11-01 16:29:55 +01:00
ines
5ab4e96144
Update v2 guide and split into partials
2017-11-01 14:13:36 +01:00
ines
1c7313051f
Document Token.is_sent_start
2017-11-01 14:13:22 +01:00
ines
9e429b5a8a
Update formatting of deprecation note
2017-11-01 14:13:08 +01:00
ines
0fbab8160d
Update GloVe vectors example
2017-11-01 13:14:43 +01:00
ines
a6f6bd6c98
Adjust tag spacing
2017-11-01 02:04:00 +01:00
ines
f84660986a
Update example sentences for models quickstart
2017-11-01 01:57:33 +01:00
ines
3b7ec64caa
Add PYTHONPATH to build from source quickstart
2017-11-01 01:52:45 +01:00
ines
092333afd4
Update vector details and number conversion
2017-11-01 01:47:31 +01:00
ines
5fd851a80b
Log errors
2017-11-01 01:46:50 +01:00
ines
07d02c3304
Update vectors and similarity usage guide
2017-11-01 01:25:17 +01:00
ines
0d8f4a534b
Update Vectors API docs
2017-11-01 00:56:54 +01:00
ines
9eb998443f
Update language tokenizer dependencies
2017-11-01 00:56:35 +01:00
ines
0cde065ed9
Add Irish to list of languages (see #1152 )
2017-11-01 00:56:21 +01:00
Ines Montani
3c8db3e4da
Merge pull request #1473 from explosion/refactor-javascript
...
Refactor website JS and add model comparison tool
2017-10-31 14:02:05 +01:00
ines
be5b635388
Remove "needs model" and add info about models (see #1471 )
2017-10-31 13:37:55 +01:00
ines
5af6c8b746
Update training docs
2017-10-30 20:28:00 +01:00
ines
8ad4f3f6e5
Take out JSON format include in tagger/parser
2017-10-30 19:48:35 +01:00
ines
33af6ac69a
Use even smaller examle size
...
100 was still too much, so try 20 instead
2017-10-30 19:46:45 +01:00
ines
f02b0af821
Fix path and use smaller example size
...
500 was too larger and caused laggy rendering
2017-10-30 19:44:35 +01:00
ines
18dde7869a
Update training data docs and add vocab JSONL
2017-10-30 19:40:05 +01:00
ines
57534253e6
Move CLI docs to own page
2017-10-30 19:39:26 +01:00
ines
ec657c1ddc
Update vocab docs and document Vocab.prune_vectors
2017-10-30 19:35:41 +01:00
ines
12343e23fd
Update CLI docs and document vocab command
2017-10-30 18:59:08 +01:00
ines
5598542055
Add link
2017-10-30 18:58:55 +01:00
ines
abf8aa05d3
Populate --create-meta defaults from file if available
...
If meta.json is found in directory and user chooses to overwrite it, show existing data as defaults.
2017-10-30 18:39:38 +01:00
ines
3ffbb64ab6
Unify chart options and update styleguide
2017-10-30 17:25:49 +01:00
ines
14ad92d337
Ensure fallbacks / progressive enhancement if JS disabled
2017-10-30 16:16:19 +01:00
ines
1eb1ed0c7c
Add tool for model comparison (experimental)
...
User can select two model and their meta is fetched from GitHub. Features, accuracy figures and speed benchmarks are displayed in a table, with an additional chart comparing the accuracy scores if available. Main use case: demonstrating and visualising trade-offs between larger and smaller models of the same type.
2017-10-30 14:09:43 +01:00
ines
fb2710211b
Integrate rollup into website build process
2017-10-30 14:08:26 +01:00
ines
38ef4274b6
Remove confusing icon for non-compatible models
...
ModelLoader will now output "not compatible" if no compatible version of model is found for a spaCy version
2017-10-30 14:07:42 +01:00
ines
8db3da3c3d
Refactor JS, split into modules and add nomodule option
...
rollup.js will be compiled by the rollup package and Babel on build, and will be loaded if a browser doesn't yet support JS modules
2017-10-30 14:06:25 +01:00
ines
5453821a9f
Update NER annotation scheme
...
Add note on training data sources and include coarse-grained Wikipedia scheme
2017-10-30 13:53:49 +01:00
ines
df149455f9
Don't ever wrap navigation bar contents
2017-10-30 13:16:20 +01:00
ines
74dd0ee2c2
Prevent responsive tables form scrolling vertically
2017-10-30 13:16:06 +01:00
ines
ae45446978
Remove comment
2017-10-30 13:15:46 +01:00
ines
25f6331550
Allow other style arguments on +grid-col
2017-10-30 13:15:30 +01:00
ines
08869c19fd
Merge mixins and mixins-base
...
The distinction was never clear anyways and it was progressively getting messier. So all mixins live in one file now.
2017-10-30 13:15:13 +01:00
ines
ae2ad5becc
Remove charts from model direcory and add speed benchmarks
...
With speed benchmarks, charts ended up taking up too much space – and they were mostly data porn and not particularly useful anyways. Instead, we might add a "Compare" page that fetches all models and lets the user compare two or more models in terms of accuracy, speed etc.
2017-10-29 03:58:19 +01:00
ines
47fd254ba7
Combine table scroll shadows if row has only one cell
2017-10-29 03:56:37 +01:00
ines
b11928abc2
Adjust labels, spacing and hack specificity
2017-10-29 03:56:09 +01:00
ines
af0ba014d2
Document +code-new and +code-old
2017-10-29 03:54:13 +01:00
ines
9b6828bd83
Add height option to +chart and document
2017-10-29 03:53:59 +01:00
ines
e18744823b
Add placeholders for Italian and Portuguese models
2017-10-29 01:29:39 +02:00
ines
3b1cfa3455
Add GPL license link
2017-10-29 01:18:32 +02:00
ines
5147cdc468
Fix formatting and add missing v2 label
2017-10-29 01:18:09 +02:00
ines
53bfcdba31
Make tooltips/tags and old/new code blocks more accessible (see #(see #1471 ))
...
Always add tooltip text as hidden label. Use different tooltip icons for tags and inline help icons. Add labels to old/new code blocks and add option to customise label text.
2017-10-29 01:17:49 +02:00
ines
4a4f9666b2
Improve style/accessibility of yes/no/neutral icons (see #1471 )
...
Use distinctive icons instead of only colour, add proper handling of labels (hidden or visible, but always present) with optional custom text.
2017-10-29 01:14:30 +02:00
ines
a8e10f94e4
Tidy up Lexeme and update docs
2017-10-27 21:07:50 +02:00
ines
5167a0cce2
Tidy up Vectors and docs
2017-10-27 19:45:19 +02:00
ines
544a407b93
Tidy up Doc, Token and Span and add missing docs
2017-10-27 17:07:26 +02:00
ines
6a0483b7aa
Tidy up and document Doc, Token and Span
2017-10-27 15:41:45 +02:00
ines
298c3d973c
Document Doc.get_lca_matrix
2017-10-27 14:37:53 +02:00
ines
9ff9afe889
Update spacy convert CLI docs
2017-10-27 14:37:42 +02:00
ines
52f1bf2729
Adjust GitHub embeds
2017-10-27 12:30:59 +02:00
Ines Montani
4033e70c71
Merge pull request #1461 from explosion/feature/disable-pipes
...
💫 Add Language.disable_pipes(), to temporarily edit pipeline and update code examples
2017-10-27 12:21:40 +02:00
ines
b5643d8575
Update intent parser docs and add to usage docs
2017-10-27 04:49:05 +02:00
ines
954c88f4d8
Fix formatting
2017-10-27 04:48:41 +02:00
ines
af28ca1ba0
Move example to pipeline directory
2017-10-27 02:00:01 +02:00
ines
1d69a46cd4
Update multi-processing example and add to docs
2017-10-27 01:58:55 +02:00
ines
647ef64f86
Update textcat docs
2017-10-27 00:51:29 +02:00
ines
a7b9074b4c
Update textcat training example and docs
2017-10-27 00:48:45 +02:00
ines
cc2917c9e8
Update fastText example and add to examples in docs
2017-10-26 18:47:02 +02:00
ines
daed7ff8fe
Update information extraction examples
2017-10-26 18:46:11 +02:00
ines
b90e958975
Update tagger and parser examples and add to docs
2017-10-26 16:27:42 +02:00
ines
0575e9cf20
Add parser example to docs
2017-10-26 16:12:34 +02:00
ines
281f88a59c
Update NER training examples
2017-10-26 14:44:43 +02:00
ines
8116d1a077
Add note on biluo_tags_from_offsets helper
2017-10-26 14:44:32 +02:00
ines
9bf78d5fb3
Update spacy.explain docs
2017-10-26 13:04:25 +02:00
ines
96b4214303
Add notes on pipe template inheritance in docs
2017-10-26 12:57:32 +02:00
ines
e6536d231f
Update new entity type training example in docs
2017-10-25 22:17:23 +02:00
ines
400812d9b1
Add add_label method to Pipe
2017-10-25 22:17:11 +02:00
ines
70de2dd035
Display vectors in models directory if available (see #1457 )
2017-10-25 16:15:37 +02:00
ines
1a722dac31
Merge branch 'develop' into feature/disable-pipes
2017-10-25 15:18:18 +02:00
ines
0102561f34
Update docs
2017-10-25 13:57:55 +02:00
ines
68e9de6917
Add documentation
2017-10-25 13:57:14 +02:00
ines
3484174e48
Add Language.path
2017-10-25 11:57:43 +02:00
ines
c815ff65f6
Update feature list
2017-10-24 21:49:11 +02:00
ines
d71702b827
Fix formatting
2017-10-24 20:11:04 +02:00
ines
6686e53530
Allow GitHub embeds to specify optional language
2017-10-24 16:00:56 +02:00
ines
56a47f137f
Add title description for tokenizer
2017-10-24 16:00:56 +02:00
ines
3944c1d6e7
Document lemmatizer
2017-10-24 16:00:56 +02:00
ines
c9dc88ddfc
Document current JSON format for training
2017-10-24 16:00:56 +02:00
Matthew Honnibal
ef3e5a361b
Merge pull request #1442 from explosion/feature/fix-sp
...
💫 Fix SP tag, tweak Vectors.__init__, fix Morphology
2017-10-24 10:24:07 +02:00
Matthew Honnibal
fdf25d10ba
Merge pull request #1440 from ramananbalakrishnan/develop
...
Support single value for attribute list in doc.to_array
2017-10-24 10:23:12 +02:00
ines
7701984f13
Document Span.as_doc
2017-10-23 10:38:27 +02:00
ines
db15902e84
Tidy up
2017-10-23 10:38:21 +02:00
ines
3f0a157b33
Fix typo
2017-10-23 10:38:13 +02:00
Matthew Honnibal
ebecaddb76
Make 'data_or_width' two keyword args in Vectors.__init__
...
Previously the data and width options were one argument in Vectors,
which meant you couldn't say vectors = Vectors(strings, width=300).
It's better to have two keywords.
2017-10-20 14:17:15 +02:00
ines
108f1f786e
Update symbols and document missing token attributes (see #1439 )
2017-10-20 13:08:44 +02:00
ines
4acab77a8a
Add missing symbol for LAW entities ( resolves #1427 )
2017-10-20 13:07:57 +02:00
Ramanan Balakrishnan
d44a079fe3
Update documentation on doc.to_array
2017-10-20 14:25:38 +05:30
Matthew Honnibal
61bc203f3f
Merge pull request #1438 from explosion/feature/fast-parser
...
💫 Improve runtime CPU efficiency of parser/NER
2017-10-19 02:42:21 +02:00
Matthew Honnibal
d4cfff0476
Comment out currently hard-coded hyper-params
2017-10-19 00:47:24 +02:00
Ines Montani
f0d577e460
Merge pull request #1425 from explosion/feature/hindi-tokenizer
...
💫 Basic Hindi tokenization support
2017-10-18 13:34:52 +02:00
ines
a74cba2ffa
Remove Binder from docs (now covered by Doc API)
2017-10-17 16:27:19 +02:00
ines
8ca344712d
Add Language.has_pipe method
2017-10-17 11:20:07 +02:00
ines
4cfe259266
Fix formatting
2017-10-16 20:36:41 +02:00
ines
18793efef1
Remove Russian from v2.0 docs for now
2017-10-16 20:36:36 +02:00
ines
d383612225
Add note about word vectors in example (see #1117 )
2017-10-16 20:31:58 +02:00
Matthew Honnibal
010a7309ff
Merge pull request #1402 from explosion/feature/fix-matcher-operators
...
💫 Fix Matcher variable-length operators
2017-10-16 17:53:19 +02:00
ines
63393b4e0d
Update matcher docs to reflect operator changes
2017-10-16 13:44:12 +02:00
ines
15514dc333
Add section on upgrading
2017-10-14 22:14:47 +02:00
ines
c0aceb9fbe
Add Hindi to supported languages
2017-10-14 15:16:41 +02:00
ines
a5da683578
Add Russian to alpha docs and update tokenizer dependencies
2017-10-14 12:52:41 +02:00
ines
a69f4e56e5
Remove outdated aside
2017-10-14 12:52:07 +02:00
ines
bb6ecb82e5
Ensure long file paths in code examples break if needed
2017-10-14 12:51:52 +02:00
ines
bfd9506f1d
Update extensions docs and add resources
2017-10-13 00:18:13 +02:00
ines
5f5d6897e8
Increment version
2017-10-13 00:18:02 +02:00
ines
9fd68334ab
Add validate command docs
2017-10-12 23:36:48 +02:00
Ines Montani
37aa523a8e
Merge pull request #1408 from explosion/feature/dot-underscore
...
💫 Custom attributes via Doc._, Token._ and Span._
2017-10-11 18:35:56 +02:00
ines
eac9e99086
Update docs on adding lemmatization to languages
2017-10-11 14:21:15 +02:00
ines
f4ae6763b9
Fix consistency of imports from spacy.tokens in examples
2017-10-11 02:30:40 +02:00