Commit Graph

2048 Commits

Author SHA1 Message Date
Ines Montani
71723cece1 Add note on visualizing long texts ans sentences (see #2636) [ci skip] 2018-08-08 15:28:21 +02:00
Ines Montani
6147bd3eb4 Fix link target (closes #2645) [ci skip] 2018-08-08 15:03:52 +02:00
Ines Montani
8c47da1f19 Update Language serialization docs (see #2628) [ci skip]
Add note on using from_disk and from_bytes via subclasses and add example
2018-08-07 14:17:57 +02:00
Matthew Honnibal
664cfc29bc Merge branch 'master' of https://github.com/explosion/spaCy 2018-08-07 10:49:39 +02:00
Matthew Honnibal
2278c9734e Fix spelling error #2640 2018-08-07 10:49:21 +02:00
Xiaoquan Kong
f0c9652ed1 New Feature: display more detail when Error E067 (#2639)
* Fix off-by-one error

* Add verbose option

* Update verbose option

* Update documents for verbose option
2018-08-07 10:45:29 +02:00
Ines Montani
6a4360e425 Update universe [ci skip] 2018-08-02 17:33:08 +02:00
Sami
dbc993f5b3 Updating description and code snippet spacy-lefff (#2623)
* updating description and code snippet spacy-lefff

* contributors agreement
2018-08-02 17:25:27 +02:00
Vikas Kumar Yadav
d3e21aad64 Update _benchmarks.jade (#2618) 2018-08-02 00:28:28 +02:00
Brian Phillips
8227de0099 Update language.jade (#2616) 2018-07-31 12:34:42 +02:00
Ioannis Daras
055cc0de44 Bug fix to pseudocode for tokenizer customization (#2604) 2018-07-27 11:04:12 +02:00
Andriy Mulyar
e9ef51137d Fixed typo (#2596)
Changed 'The index of the first character after the span.' to The index of the last character after the span' in description of doc.char_span
2018-07-25 22:17:15 +02:00
Ines Montani
75f3234404
💫 Refactor test suite (#2568)
## Description

Related issues: #2379 (should be fixed by separating model tests)

* **total execution time down from > 300 seconds to under 60 seconds** 🎉
* removed all model-specific tests that could only really be run manually anyway – those will now live in a separate test suite in the [`spacy-models`](https://github.com/explosion/spacy-models) repository and are already integrated into our new model training infrastructure
* changed all relative imports to absolute imports to prepare for moving the test suite from `/spacy/tests` to `/tests` (it'll now always test against the installed version)
* merged old regression tests into collections, e.g. `test_issue1001-1500.py` (about 90% of the regression tests are very short anyways)
* tidied up and rewrote existing tests wherever possible

### Todo

- [ ] move tests to `/tests` and adjust CI commands accordingly
- [x] move model test suite from internal repo to `spacy-models`
- [x] ~~investigate why `pipeline/test_textcat.py` is flakey~~
- [x] review old regression tests (leftover files) and see if they can be merged, simplified or deleted
- [ ] update documentation on how to run tests


### Types of change
enhancement, tests

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [ ] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-24 23:38:44 +02:00
kororo
b1ec827ee0 Fix typo (#2579)
Update slogan, desc and code snippet to latest version
2018-07-24 22:47:33 +02:00
ines
cd687091fb Remove nl examples from widget for now [ci skip]
Restore for next spaCy version when path to example sentences is fixed
2018-07-24 22:41:20 +02:00
ines
2d8ffb8bcd Fix formatting 2018-07-24 22:40:49 +02:00
ines
1b3da8d2ae Update website for v2.0.12 [ci skip] 2018-07-24 21:04:22 +02:00
ines
ae5ed2d698 Update docs for v2.0.12 [ci skip] 2018-07-21 15:51:44 +02:00
ines
d517dd4297 Document remove_extension methods 2018-07-21 15:51:28 +02:00
ines
153f41a5cc Use better examples for Doc extension methods 2018-07-21 15:51:11 +02:00
ines
3c30d1763c Merge branch 'master' into develop 2018-07-21 15:34:18 +02:00
kororo
2784babef9 Add ExcelCy into Universe list (#2572)
Hi guys,

This is my first spaCy extension. I am excited to able to do this. Please do let me know if there is any suggestions or modifications I need to do. Feel free to use/contribute the repo that I made.

## Description
ExcelCy is a SpaCy toolkit to help improve the data training experiences. It provides easy annotation using Excel file format. It has helper to pre-train entity annotation with phrase and regex matcher pipe.

### Types of change
Update to Universe list in website.

## Checklist
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-07-19 19:28:33 +02:00
ines
80e7485630 Merge branch 'master' into develop 2018-07-18 17:28:47 +02:00
Xiang Ji
19a5ef1c58 Fix venv command examples (#2560) [ci skip]
* Fix venv command examples

The documentation refers to `venv`, which is native to Python3.
However, the command examples are as if they were still `virtualenv`,
which is a package independent of `venv`:

- It doesn't need to be installed via `pip`. In fact `pip install venv` would
return an error.
- The correct way to invoke `venv` is `python3 -m venv`, not `venv`, which would
return command not found.

See https://docs.python.org/3/library/venv.html

I suspect the documentation simply replaced all occurrences of `virtualenv` with
`venv`. However they are different modules and are used differently.

* Update comment [ci skip]
2018-07-18 10:31:24 +02:00
ines
50c367ee96 Update meta [ci skip] 2018-07-10 13:51:45 +02:00
ines
3a321e79ac Merge branch 'master' into develop 2018-07-10 13:49:08 +02:00
ines
71bfc92913 Exclude models for non-stable versions [ci skip] 2018-07-10 13:44:55 +02:00
ines
b5200962c0 Adjust formatting [ci skip] 2018-07-09 18:35:46 +02:00
Alex Villarreal
bd35bf7f09 Guidance to handle binary files in git in Windows (#2526)
Adds guidance on what to do if users encounter the error described in [1634](https://github.com/explosion/spaCy/issues/1634), which probably only happens in Windows environments.
2018-07-09 18:31:37 +02:00
ines
f575b01595 Update language and license meta [ci skip] 2018-07-04 15:09:36 +02:00
ines
63666af328 Merge branch 'master' into develop 2018-07-04 14:52:25 +02:00
Matthew Honnibal
a85620a731 Note CoreNLP tokenizer correction on website 2018-07-02 11:35:31 +02:00
ines
06c6dc6fbc Update Juniper [ci skip] 2018-06-28 11:48:17 +02:00
Nipun Sadvilkar
741ba80bd5 Train model command n_iteration 20 -> 30 (#2454)
In source code `train.py` default Number of iterations  is 30
2018-06-18 11:57:08 +02:00
ines
53a2bc8c8d Only scroll sidebar item into view if needed [ci skip] 2018-06-12 10:58:50 +02:00
ines
65713a6593 Increment versions [ci skip] 2018-06-12 10:49:50 +02:00
Ines Montani
968f6f0bda
💫 Document Cython API (#2433)
## Description

This PR adds the most relevant documentation of spaCy's Cython API.

(Todo for when we publish this: rewrite `/api/#section-cython` and `/api/#cython` to `/api/cython#conventions`.)

### Types of change
docs

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2018-06-11 17:47:46 +02:00
GolanLevy
72d7e80f94 adding a missing apostrophe (#2436) 2018-06-11 17:47:24 +02:00
ines
778e5f4da3 Merge branch 'master' into develop 2018-06-11 00:38:04 +02:00
himkt
57311d5d47 replace janome with mecab in the documentation and the test (#2415)
* Add links to Reddit data (see #2401)

* replace janome with mecab in the documentation and the test

* add the assignment
2018-06-11 00:33:13 +02:00
ines
effb55d591 Adjust formatting [ci skip] 2018-06-11 00:29:13 +02:00
Nathan Breit
ba6d2cf393 Add EpiTator to Universe (#2429) 2018-06-11 00:24:13 +02:00
himkt
1a568f2e08 fix wrong documentations (#2423) 2018-06-11 00:21:06 +02:00
Bohdan Moskalevskyi
d66292f767 fix UD data file extensions (#2425)
* fix UD data files extension

* add contributor agreement for msklvsk
2018-06-08 14:26:11 +02:00
ines
a0017e4909 Merge branch 'master' into develop 2018-05-30 14:10:47 +02:00
ines
0baaf836cf Update formatting [ci skip] 2018-05-30 13:32:49 +02:00
ines
3913e18201 Add self-attentive-parser to universe (see #59) 2018-05-30 13:31:28 +02:00
ines
4a62486340 Merge branch 'master' into develop 2018-05-30 13:01:01 +02:00
ines
605c663a4c Fix HTML merger examples (see #2390) 2018-05-30 12:22:32 +02:00
ines
d0b16aa014 Update list of languages 2018-05-26 18:56:26 +02:00
Samuel Pouyt
5f988b8e9c Update _custom.jade (#2372)
It seems based on the doc and trying out that the `en` or `[lang]` is missing from the `spacy model-init`
2018-05-26 18:17:12 +02:00
ines
d84a830d79 Merge branch 'master' of https://github.com/explosion/spaCy 2018-05-26 17:57:05 +02:00
ines
fb923b31ea Fix bad HTML example (see #2376) and turn it into section on matcher + components
Avoid problems caused by merging while matching (e.g. index errors). Creating a Matcher component also better reflects the recommended best practices.
2018-05-26 17:57:02 +02:00
Shantam Raj
592834183a corrected spelling (#2359)
changed **interpretted** to **interpreted**
2018-05-24 13:29:52 +02:00
ines
8adb967e0c Fix from source quickstart instructions for Windows
See: https://stackoverflow.com/a/50478036/6400719
2018-05-24 12:42:16 +02:00
Shantam Raj
1a4682dd0b Update _training.jade (#2340)
* Update _training.jade

Correcting grammar. Replacing "The" with "To".

* Create armsp.md

* Update armsp.md
2018-05-21 11:09:33 +02:00
ines
ff1082d8e4 Add version tag in CLI docs [ci skip] 2018-05-21 01:17:49 +02:00
Ines Montani
d4cc736b7c 💫 Improve model downloads: check for existing install, customise pip and use requests library again (#2346)
* Go back to using requests instead of urllib (closes #2320)

Fewer dependencies are good, but this one was simply causing too many other problems around SSL verification and Python 2/3 compatibility. requests is a popular enough package that it's okay for spaCy to depend on it – and this will hopefully make model downloads less flakey.

* Only download model if not installed (see #1456)

Use #egg=model==version to allow pip to check for existing installations. The download is only started if no installation matching the package/version is found. Fixes a long-standing inconvenience.

* Pass additional options to pip when installing model (resolves #1456)

Treat all additional arguments passed to the download command as pip options to allow user to customise the command. For example:

python -m spacy download en --user

* Add CLI option to enable installing model package dependencies

* Revert "Add CLI option to enable installing model package dependencies"

This reverts commit 9336ffe695.

* Update documentation
2018-05-20 20:26:56 +02:00
vishnumenon
ae3719ece5 Fix the code for FACILITIY entities (#2324)
* Fix the code for FACILITIY entities

As far as I can tell, the default models all use "FAC" rather than "FACILITY"

* Added my Contributor Agreement

* Rename vishnumenon to vishnumenon.md
2018-05-12 15:19:17 +02:00
ines
ac25bc4016 Add docs section on sentence segmentation [ci skip] 2018-05-07 21:25:20 +02:00
ines
14148cd147 Fix formatting and wording 2018-05-07 21:24:35 +02:00
ines
f803da609f Add scattertext [ci skip] 2018-05-07 19:10:23 +02:00
ines
c9547b7b8b Update Juniper (see #2293) 2018-05-03 15:36:02 +02:00
Alex Villarreal
647f2544c5 Fix code sample for span.set_extension (#2286) 2018-05-03 00:39:22 +02:00
Alex Villarreal
13d562e1a4 Fix code sample for Doc.set_extension (#2282)
* Fix code sample for `set_extension`

The previous sample code for `set_extension` fails the assertion at the end, because `city_getter` it checked if the whole document text matches any of the city names. Now it checks if any of the city names is contained in the document text.

* Contributor agreement
2018-05-02 10:16:05 +02:00
Shirish Kadam
d98a90440f Added Adam project to spaCy Universe (#2275)
* Added 5hirish to contributors

* Added Adam Qas Project to spaCy Universe

* Remove $ from code example
2018-04-30 22:25:01 +02:00
ines
56e7faf16b Fix spacing 2018-04-30 22:24:40 +02:00
ines
6efb4cdf88 Use Juniper and tidy up 2018-04-30 18:48:35 +02:00
ines
45bb8d75a5 Fix overflow issues on small screens [ci skip] 2018-04-29 03:17:36 +02:00
Ines Montani
49cee4af92
💫 Interactive code examples, spaCy Universe and various docs improvements (#2274)
* Integrate Python kernel via Binder

* Add live model test for languages with examples

* Update docs and code examples

* Adjust margin (if not bootstrapped)

* Add binder version to global config

* Update terminal and executable code mixins

* Pass attributes through infobox and section

* Hide v-cloak

* Fix example

* Take out model comparison for now

* Add meta text for compat

* Remove chart.js dependency

* Tidy up and simplify JS and port big components over to Vue

* Remove chartjs example

* Add Twitter icon

* Add purple stylesheet option

* Add utility for hand cursor (special cases only)

* Add transition classes

* Add small option for section

* Add thumb object for small round thumbnail images

* Allow unset code block language via "none" value

(workaround to still allow unset language to default to DEFAULT_SYNTAX)

* Pass through attributes

* Add syntax highlighting definitions for Julia, R and Docker

* Add website icon

* Remove user survey from navigation

* Don't hide GitHub icon on small screens

* Make top navigation scrollable on small screens

* Remove old resources page and references to it

* Add Universe

* Add helper functions for better page URL and title

* Update site description

* Increment versions

* Update preview images

* Update mentions of resources

* Fix image

* Fix social images

* Fix problem with cover sizing and floats

* Add divider and move badges into heading

* Add docstrings

* Reference converting section

* Add section on converting word vectors

* Move converting section to custom section and fix formatting

* Remove old fastText example

* Move extensions content to own section

Keep weird ID to not break permalinks for now (we don't want to rewrite URLs if not absolutely necessary)

* Use better component example and add factories section

* Add note on larger model

* Use better example for non-vector

* Remove similarity in context section

Only works via small models with tensors so has always been kind of confusing

* Add note on init-model command

* Fix lightning tour examples and make excutable if possible

* Add spacy train CLI section to train

* Fix formatting and add video

* Fix formatting

* Fix textcat example description (resolves #2246)

* Add dummy file to try resolve conflict

* Delete dummy file

* Tidy up [ci skip]

* Ensure sufficient height of loading container

* Add loading animation to universe

* Update Thebelab build and use better startup message

* Fix asset versioning

* Fix typo [ci skip]

* Add note on project idea label
2018-04-29 02:06:46 +02:00
ines
a512fa60ef Remove upcoming option from docs for now 2018-04-28 23:32:18 +02:00
ines
6fb6371670 Add collapse_phrases option to displacy (closes #2266) 2018-04-28 23:06:50 +02:00
Matt Upson
87cc6b3599 Add missing comma to NN example in docs (#2255)
Also add a completed contributor agreement.
2018-04-28 14:56:00 +02:00
ines
4a3bea00c7 Update resources [ci skip] 2018-04-26 22:10:34 +02:00
Pradeep Kumar Tippa
df389e5b74 spacy-101 vocab doc giving valid variable names (#2236) 2018-04-18 14:54:26 -07:00
ines
ce63f8997b Update init-model docs 2018-04-10 21:42:54 +02:00
ines
0e847d7fe5 Fix typo 2018-04-09 14:51:14 +02:00
ines
de137fba84 Add TensorBoard examples to examples overview [ci skip] 2018-04-03 16:01:52 +02:00
ines
6d87b28f15 Add Vietnamese to language overview [ci skip] 2018-04-03 16:01:36 +02:00
ines
9615ed5ed7 Update emoji/hashtag matcher example (resolves #2156) [ci skip] 2018-03-28 18:41:28 +02:00
ines
ce6071ca89 Remove ftfy dependency and update docs 2018-03-28 12:09:42 +02:00
ines
5ecc60cf3b Add book to resources [ci skip] 2018-03-24 17:12:56 +01:00
ines
53680642af Port over docs changes [ci skip] 2018-03-24 17:12:48 +01:00
Matthew Honnibal
f9f46e5a07 Revert matcher fixes from GregDubbin 2018-02-18 10:59:28 +01:00
ines
612c79a4f5 Update first matcher example and match_id (resolves #1989) 2018-02-17 11:57:38 +01:00
ines
ca56fb53d1 Add user survey to navigation [ci skip] 2018-02-15 12:14:30 +01:00
ines
cab5b775e7 Document ENT_TYPE matcher attribute [ci skip] 2018-02-15 12:14:19 +01:00
Pradeep Kumar Tippa
416cd021ce Added TAG from spacy symbols which used below 2018-02-09 19:16:59 +05:30
Pradeep Kumar Tippa
01cc9cd9c0 assert statement syntax fix in doc 2018-02-09 19:16:25 +05:30
Pradeep Kumar Tippa
a78062e466 Merge remote-tracking branch 'upstream/master' into web-doc-patches 2018-02-09 19:13:19 +05:30
ines
ab33e274f5 Add more details on symlink error & Windows solution (resolves #1941) [ci skip] 2018-02-09 10:43:33 +01:00
ines
8eaa934382 Merge branch 'master' of https://github.com/explosion/spaCy 2018-02-09 10:23:36 +01:00
ines
e9f67be04d Fix regex flag matcher example (resolves #1950) 2018-02-09 10:23:33 +01:00
ines
fc4ae04c55 Document LENGTH attribute in matcher 2018-02-09 10:23:03 +01:00
Pradeep Kumar Tippa
8a7467b26e Merge remote-tracking branch 'upstream/master' into web-doc-patches 2018-02-09 13:54:26 +05:30
Orion Montoya
24af6375db
update link to Honnibal and Johnson 2015
aclweb.org is throwing a gateway timeout on the link as `https`+`aclweb.org`, but is fine with `https`+`www.aclweb.org` (also with `http`+`aclweb.org`, but let's keep it in `https`, shall we?
2018-02-08 10:49:09 -08:00
Pradeep Kumar Tippa
03113d6779 Fixing navigating parse tree doc under dependency parse 2018-02-08 19:34:15 +05:30
ines
a3b965b29d Remove UPPER from Matcher attributes docs (resolves #1949) 2018-02-08 11:29:27 +01:00
ines
696ae87b47 Fix whitespace 2018-02-08 11:28:54 +01:00
ines
26bc75134d Fix typo 2018-02-08 11:28:44 +01:00
Pradeep Kumar Tippa
da9d687e75
Fixing typo from taining to training 2018-02-07 16:49:25 +05:30
Pradeep Kumar Tippa
ed7d268e93
Fixing vocab doc
Replacing "like" with "love", coffee suffix should be "fee" but not "ffe"
2018-02-07 14:55:12 +05:30
ines
f377c483e4 Add note on manual entity order in displaCy [ci skip] 2018-02-07 01:08:42 +01:00
ines
58eb178667 Update Doc.char_span docs [ci skip] 2018-02-07 01:08:30 +01:00
sayf eddine hammemi
86e7727855 Fix typo in the word build. 2018-02-04 20:48:45 +01:00
ines
901bc0e85f Add Persian to list of languages [ci skip] 2018-02-01 04:47:34 +01:00
Hassan Shamim
a0b912c528 fix broken link to test suite models 2018-01-30 15:01:01 -08:00
greg
daefed0a34 Correct documentation of '+' and '*' ops 2018-01-22 15:55:44 -05:00
ines
67ba73351d Fix typo and use better serialization example (resolves #1851) [ci skip] 2018-01-16 18:42:03 +01:00
ines
7943a8e90c Add spacy-lookup by @mpuig [ci skip] 2018-01-16 00:28:46 +01:00
ines
5684206154 Add LanguageCrunch by @artpar [ci skip] 2018-01-15 16:14:26 +01:00
Mateusz Tatusko
dda0e58c11
Update _pos-tags.jade
really small changes to English tags description, but might help some people while working on projects
1) -PRB- should be -RRB- instead 
2) space gets tagged as _SP, and not SP
2018-01-15 12:01:51 +09:00
ines
0536e91564 Add note on Tagger.tag_names vs. Tagger.labels (see #1666) [ci skip] 2018-01-14 14:37:19 +01:00
ines
bbee48080d Clarify hyperparameters and alias usage in spacy train (resolves #1838) [ci skip] 2018-01-14 14:32:50 +01:00
ines
4daba3abda Add regex section to rule-based matching docs (see #1567, #1833) [ci skip] 2018-01-14 14:22:13 +01:00
Ines Montani
36f426fe0a
Merge pull request #1808 from fucking-signup/master
Fix issue #1769
2018-01-12 21:12:02 +00:00
ines
cfac5b955f Fix aligment issues with newsletter signup form 2018-01-12 22:06:44 +01:00
ines
65babd9e2e Fix typo, formatting and operator descriptions (resolves #1820) 2018-01-12 22:06:27 +01:00
Matthew Honnibal
a2a06dce24
Merge pull request #1792 from explosion/feature-improve-model-download
💫 Improve model downloading and linking
2018-01-11 20:02:08 +01:00
Ines Montani
11676b47f2
Merge pull request #1828 from wrathagom/patch-1
Small Grammar Fix to _basics.jade
2018-01-11 17:27:23 +00:00
pbnsilva
4cfd848bc3 Fixes typo in PhraseMatcher API docs 2018-01-11 17:35:59 +01:00
Caleb M. Keller
e68f6bf890
Small Grammar Fix to _basics.jade
Fixed an incorrect word order.
2018-01-11 09:26:47 -05:00
Matthew Honnibal
7ca49c2061
Merge branch 'master' into feature-improve-model-download 2018-01-10 18:21:55 +01:00
Kit
db6e4ba72e
Update code example according to new changes 2018-01-08 03:45:56 +01:00
ines
ef210c73dd Update cli.download and cli.validate docs 2018-01-03 21:34:03 +01:00
ines
cc9df10e69 Document util.set_lang_class (see #1737) 2018-01-03 20:13:25 +01:00
Ines Montani
874f174ab1
Merge pull request #1790 from nirdesh37/patch-1
Update goldparse.jade
2018-01-03 18:37:07 +00:00
ines
1fa6ba8130 Fix Doc.from_array example to make it work (see #1527) 2018-01-03 16:59:38 +01:00
ines
49635350f0 Add .from_disk() to pipeline component init example (resolves #1728) 2018-01-03 16:50:24 +01:00
ines
95063ba26b Update tests documentation (resolves #1781) 2018-01-03 16:42:26 +01:00
nirdesh37
67fdceed6a
Update goldparse.jade 2018-01-03 17:25:21 +05:30
Martin Andrews
e4355dade2
Documentation example fix : token.head needs '==' rather than 'is'
(similar change to #1689, it seems).
2017-12-18 18:12:10 +08:00
Kristofer Berggren
1cb8c997fb
Fix typo Span -> Token on Token API page
Change Span.vector_norm to Token.vector_norm.
2017-12-17 20:32:19 +08:00
Ines Montani
4befd8bd44
Merge pull request #1724 from mpuels/patch-7
doc: Fix minor mistakes
2017-12-17 12:09:17 +00:00
ines
21482b391b Fix head 2017-12-16 13:48:19 +01:00
mpuels
b3df2a2ffd
doc: Fix minor mistakes 2017-12-14 20:55:59 +01:00
mpuels
3f7bedadee
doc: Fix minor mistakes 2017-12-13 11:37:24 +01:00
ines
24e80c51b8 Document init-model command 2017-12-07 10:14:37 +01:00
mpuels
e3af19a076
doc: Replace 'is not' with '!=' in code example
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head is not token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

Replacing `is not` with `!=` solves the issue:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head != token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

The output is

    ['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
mpuels
82e575ebfb
doc: Fix assert statement in Lightning Tour
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
mpuels
662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines
b078e276e6 Document offsets_from_biluo_tags 2017-12-06 13:40:51 +01:00
ines
fb663f9b7d Add Russian to list of languages 2017-12-06 13:40:32 +01:00
ines
58a19518cf Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 13:17:58 +01:00
ines
7ade336ab7 Add "Unknown locale" issue to troubleshooting guide (see #1684, #1641, #1517) 2017-12-05 13:17:55 +01:00
Mark Dodwell
9d4c185860
Fix link to CLEAR Style dependency labels PDF 2017-12-04 23:28:06 -08:00
ines
40638b7cdf Update resources 2017-12-02 04:16:03 +01:00
ines
9ea8a7cf0c Add spacy_cld to extensions 2017-12-01 23:21:33 +01:00
ines
8d3f29322f Add spacy_hunspell to resources (see #315) 2017-11-29 09:33:22 +01:00
atomobianco
f6a82da907
Corrected char index instead of token index
Changed the index used to add the label because `displacy.render` apparently uses char index
2017-11-26 23:55:25 +01:00
ines
bda6e2a816 Add training example to lightning tour 2017-11-26 18:04:18 +01:00
ines
89f8b1fba0 Update example documents 2017-11-26 18:04:04 +01:00
ines
65d66b81f1 Fix typo 2017-11-26 18:03:44 +01:00
ines
e4ee666be5 Fix biluo_tags_from_offsets example and docs 2017-11-26 16:37:32 +01:00
ines
434030e0d0 Fix requirements.txt example (see #1638) 2017-11-26 15:53:19 +01:00
Matthew Honnibal
6bc9917a0e
Another small fix to component docs 2017-11-23 11:47:20 +01:00
markulrich
c9b63c0dfc Use correct local parameter in example MyComponent (and added markulrich.md contributor file) 2017-11-22 15:59:08 -08:00
ines
4f7e64e371 Update resources 2017-11-18 02:53:00 +01:00
ines
c3051e95f7 Add note on attribute extension defaults (resolves #1587) 2017-11-17 19:14:29 +01:00
ines
954f8cc6d1 Update syntax theme (should move the modifications out to an extension sometime) 2017-11-17 19:13:53 +01:00
Raphaël Bournhonesque
a0793fd4cc
Fix typo 2017-11-17 17:57:55 +01:00
Martino Mensio
ce1aade41e small typo on docs 2017-11-17 16:20:22 +01:00
pavillet
ad2935f0c3
Update _spacy.jade
Doc example gives 'object is not subscriptable' error.
Correcting as an attribuet
2017-11-17 00:02:20 +01:00
ines
40c4e8fc09 Remove "optional" from dev_data arg and add more info (see #1578) 2017-11-14 20:26:05 +01:00
KMLDS
d5b20ac3b6
Update span.jade 2017-11-13 19:27:20 -05:00
ines
bc79274706 Fix typo 2017-11-13 17:00:03 +01:00
ines
7a7b01feb1 Update links 2017-11-13 08:30:06 +01:00
ines
b3e502a076 Add videos section to resources 2017-11-13 08:29:57 +01:00
ines
f2b6b98b75 Fix typo in code example (resolves #1556) 2017-11-13 08:29:16 +01:00
ines
ceb2c596f1 Update conda details 2017-11-11 13:07:00 +01:00
ines
4a97def06a Update features 2017-11-10 19:05:10 +01:00
ines
dea5636d6c Fix broken links 2017-11-10 13:06:38 +01:00
Wahib Faizi
0da56f8ef8
Fix typo. Add missing '='. 2017-11-10 14:51:24 +03:00
ines
4c5d2c80d5 Re-add python -m to commands, too brittle :( (see #1536) 2017-11-10 02:30:55 +01:00
ines
ee5697a1cd Fix training tips 2017-11-10 00:19:42 +01:00
ines
6ae0ebfa3a Update training tips 2017-11-10 00:17:10 +01:00
ines
b20779bac4 Update resources 2017-11-09 23:05:37 +01:00
ines
ed84688935 Remove old link 2017-11-09 15:34:12 +01:00
Ines Montani
e5b9ccdb5c
Merge pull request #1526 from mcsalgado/fix-typos
fix typos
2017-11-09 15:33:55 +01:00
Victor Salgado
fe1d969d5f fix typos 2017-11-09 10:55:13 -02:00
Mathias Deschamps
25b26f0d64
Fix similarity visual
Doc was showing similarity when dissimilar
2017-11-09 11:08:26 +01:00
ines
98767122a7 Fix typos 2017-11-09 04:13:03 +01:00
ines
e87eb11beb Update package.json 2017-11-09 04:12:57 +01:00
ines
33b84f4c39 Change clear_vectors to reset_vectors (resolves #1516) 2017-11-08 18:11:23 +01:00
ines
97a5892347 Document Vectors.resize() and update v2 incompatibilities (resolves #1514) 2017-11-08 17:11:11 +01:00
ines
c0a7a32bf8 Add en.stop_words change to v2 docs (resolves #1512) 2017-11-08 16:30:46 +01:00
ines
9b09b6b0cd Fix formatting 2017-11-08 16:30:23 +01:00
ines
f0bdfb4471 Fix vector listing for core sm models in list overview (see #1513) 2017-11-08 16:24:27 +01:00
ines
94cd3d51db Update v2 docs and model info
Take out speed tables until we fix our benchmark tests on CPU and GPU
2017-11-08 11:43:00 +01:00
ines
14f97cfd20 Add note on stream processing to migration guide (see #1508) 2017-11-08 01:53:36 +01:00
ines
5d1162cf21 Improve nlp.update / training loop overview (see #1507) 2017-11-08 01:17:42 +01:00
ines
2229aba71c Update website 2017-11-08 01:06:30 +01:00
ines
1768703e1c Update website for v2.0 2017-11-07 14:48:17 +01:00
ines
e4a05385d6 Update docs 2017-11-07 12:33:43 +01:00
ines
a4662a31a9 Move model package templates to cli.package and update docs 2017-11-07 12:15:35 +01:00
ines
a09c096d3c Get docs ready for v2.0.0 2017-11-07 12:00:43 +01:00
ines
173b1551af Update examples 2017-11-07 01:22:30 +01:00
ines
c37837cad1 Update training docs 2017-11-07 01:06:31 +01:00
ines
c7bda87b17 Update model docs and add tips section 2017-11-07 01:05:37 +01:00
ines
a1261e8632 Fix formatting 2017-11-07 01:05:30 +01:00
ines
912c1b1821 Document "simple training style" 2017-11-07 00:23:19 +01:00
ines
ad6438ccdf Update aside labels and under construction mixin 2017-11-07 00:23:00 +01:00
ines
8fb48b9b91 Update and document new util functions 2017-11-07 00:22:43 +01:00
ines
6447b8e396 Update v2 details 2017-11-06 21:15:36 +01:00
ines
008d7408cf Make vectors vs. tensors more explicit in 101 (see #1498) 2017-11-06 20:16:38 +01:00
ines
71852d3f25 Fix code mixins 2017-11-06 20:16:19 +01:00
ines
3b0699c9fe Update benchmarks and data table style 2017-11-06 19:36:02 +01:00
ines
ddff7dc474 Update GPU install docs 2017-11-06 19:35:36 +01:00
ines
64d0f97c67 Update benchmarks and models 2017-11-06 18:19:00 +01:00
Matthew Honnibal
6fdffd7246
Merge pull request #1497 from explosion/feature/improve-optimizer-handling
💫 Improve optimizer handling
2017-11-06 16:41:15 +01:00
ines
972298e0c9 Update Pipe component docs and training API 2017-11-06 14:42:24 +01:00
ines
f48e1973ed Fix accuracy table descriptions 2017-11-06 14:12:11 +01:00
ines
2d85ee6b5d Fix broken link 2017-11-06 13:27:30 +01:00
ines
efb0a7e934 Fix broken links 2017-11-06 13:20:36 +01:00
ines
42a99eae02 Update troubleshooting guide 2017-11-06 13:17:09 +01:00
ines
2dca9e71a1 Add notes on catastrophic forgetting (see #1496) 2017-11-06 13:17:02 +01:00
ines
e68d31bffa Update models quickstart usage example 2017-11-06 13:06:26 +01:00
ines
2fe2c4942f Update models directory and listing 2017-11-06 13:04:29 +01:00
ines
df1bdc7173 Add Dutch model 2017-11-06 02:44:59 +01:00
ines
333bef482f Update pattern for Prism.js Python 2017-11-06 02:44:24 +01:00
ines
6b08aefd0c Update formatting and styleguide 2017-11-05 23:31:31 +01:00
ines
e61a067c4b Update v2 docs 2017-11-05 21:41:56 +01:00
ines
86d6bd7503 Fix wording 2017-11-05 19:23:50 +01:00
ines
6742657c4d Fix website asset versioning 2017-11-05 19:23:45 +01:00
ines
2ca82d1f6e Take out pt_core_news_sm for now 2017-11-05 18:57:04 +01:00
ines
a6ffa942bb Update UD schemes 2017-11-05 18:46:24 +01:00
ines
3fa8900a6b Don't include tag and label schemes in usage guide 2017-11-05 18:21:49 +01:00
ines
4810be4b44 Update POS scheme docs and add links for other schemes 2017-11-05 18:16:34 +01:00
ines
e7d0641125 Update POS row mixins 2017-11-05 18:16:16 +01:00
ines
15de2bb01d Update and simplify other annotation scheme data 2017-11-05 16:09:48 +01:00
ines
2d59dd374b Use collapsible sections for pos/dep scheme and update
Will ensure better overview as we add more schemes for more languages
2017-11-05 16:09:30 +01:00
ines
a9c77e01b4 Add accordion component (collapsible section) 2017-11-05 16:08:13 +01:00
ines
3d4dff1845 Remove comment 2017-11-05 16:07:14 +01:00
ines
b53c2010db Add global focus style for links 2017-11-05 16:07:00 +01:00
ines
f092506578 Use hidden attribute instead of style.display 2017-11-05 16:06:50 +01:00
ines
0e8157674a Add Portuguese and French 2017-11-04 23:07:21 +01:00
ines
d9fa3c6054 Update adding languages example 2017-11-04 15:12:39 +01:00
ines
c83fe54f0c Update venv docs in installation instructions 2017-11-04 14:27:55 +01:00
ines
2940938bd8 Use more distinct style for checkboxes in quickstart 2017-11-04 14:24:30 +01:00
ines
4793d56a3e Update commands for building from source 2017-11-04 14:24:14 +01:00
ines
177bf4ee39 Update GitHub topic links 2017-11-04 14:02:28 +01:00
ines
2639ecd5f8 Add docs note on custom tokenizer rules (see #1491) 2017-11-03 23:33:18 +01:00
ines
380f2441b4 Fix script includes 2017-11-03 18:51:03 +01:00
Abhinav Sharma
c740277f9f
Minor typo [ nad => and ] 2017-11-03 16:30:44 +05:30
ines
1e16374687 Update models list to reflect spaCy v2.0.0a18 2017-11-03 11:29:34 +01:00
ines
a62b0727d8 Tidy up and always use bundle in built site for now
Just to be safe
2017-11-03 11:29:21 +01:00
ines
d0f88af5b6 Hide error earlier 2017-11-03 11:29:04 +01:00
ines
43512c68b2 Fix vector details in model overview 2017-11-02 20:04:13 +01:00
ines
9baab241b4 Add skeleton language data for Turkish 2017-11-02 16:32:24 +01:00
ines
31e349a62c Update model families 2017-11-02 16:13:38 +01:00
ines
15cbc61a6e Adjust rendering of large numbers
1234 -> 1.2k
12345 -> 12.3k
123456 -> 123k
1234567 -> 1.2m
2017-11-02 16:13:18 +01:00
ines
391fce09d9 Update licenses 2017-11-01 23:04:40 +01:00
ines
c6fea3e5f6 Add Romanian and Croatian skeletons (experimental)
Add language data templates to make it easier for others to contribute to the language support
2017-11-01 23:04:28 +01:00
ines
408f450ce0 Tidy up 2017-11-01 23:01:12 +01:00
ines
2fa53b39d5 Add dev dependency 2017-11-01 23:01:06 +01:00
ines
1976fb157f Update licenses 2017-11-01 21:49:57 +01:00
ines
2ba4e4fc88 Fix broken links and add check_links shortcut script 2017-11-01 21:11:10 +01:00
ines
e5a4c31bb4 Adjust code line height 2017-11-01 19:49:42 +01:00
ines
5dd0d6a383 Update lightning tour 2017-11-01 19:49:36 +01:00
ines
9b4c38fe9f Add button option to terminal component 2017-11-01 19:49:27 +01:00
ines
12954ab218 Don't document the tensorizer for now 2017-11-01 19:49:04 +01:00
ines
a7a76ea8c5 Update backwards incompatibilities
Also add separate section for deprecated
2017-11-01 16:31:57 +01:00
ines
4f77bb8476 Fix error handling 2017-11-01 16:29:55 +01:00
ines
5ab4e96144 Update v2 guide and split into partials 2017-11-01 14:13:36 +01:00
ines
1c7313051f Document Token.is_sent_start 2017-11-01 14:13:22 +01:00
ines
9e429b5a8a Update formatting of deprecation note 2017-11-01 14:13:08 +01:00
ines
0fbab8160d Update GloVe vectors example 2017-11-01 13:14:43 +01:00
ines
a6f6bd6c98 Adjust tag spacing 2017-11-01 02:04:00 +01:00
ines
f84660986a Update example sentences for models quickstart 2017-11-01 01:57:33 +01:00
ines
3b7ec64caa Add PYTHONPATH to build from source quickstart 2017-11-01 01:52:45 +01:00
ines
092333afd4 Update vector details and number conversion 2017-11-01 01:47:31 +01:00
ines
5fd851a80b Log errors 2017-11-01 01:46:50 +01:00
ines
07d02c3304 Update vectors and similarity usage guide 2017-11-01 01:25:17 +01:00
ines
0d8f4a534b Update Vectors API docs 2017-11-01 00:56:54 +01:00
ines
9eb998443f Update language tokenizer dependencies 2017-11-01 00:56:35 +01:00
ines
0cde065ed9 Add Irish to list of languages (see #1152) 2017-11-01 00:56:21 +01:00
Ines Montani
3c8db3e4da
Merge pull request #1473 from explosion/refactor-javascript
Refactor website JS and add model comparison tool
2017-10-31 14:02:05 +01:00
ines
be5b635388 Remove "needs model" and add info about models (see #1471) 2017-10-31 13:37:55 +01:00
ines
5af6c8b746 Update training docs 2017-10-30 20:28:00 +01:00
ines
8ad4f3f6e5 Take out JSON format include in tagger/parser 2017-10-30 19:48:35 +01:00
ines
33af6ac69a Use even smaller examle size
100 was still too much, so try 20 instead
2017-10-30 19:46:45 +01:00
ines
f02b0af821 Fix path and use smaller example size
500 was too larger and caused laggy rendering
2017-10-30 19:44:35 +01:00
ines
18dde7869a Update training data docs and add vocab JSONL 2017-10-30 19:40:05 +01:00
ines
57534253e6 Move CLI docs to own page 2017-10-30 19:39:26 +01:00
ines
ec657c1ddc Update vocab docs and document Vocab.prune_vectors 2017-10-30 19:35:41 +01:00
ines
12343e23fd Update CLI docs and document vocab command 2017-10-30 18:59:08 +01:00
ines
5598542055 Add link 2017-10-30 18:58:55 +01:00
ines
abf8aa05d3 Populate --create-meta defaults from file if available
If meta.json is found in directory and user chooses to overwrite it, show existing data as defaults.
2017-10-30 18:39:38 +01:00
ines
3ffbb64ab6 Unify chart options and update styleguide 2017-10-30 17:25:49 +01:00
ines
14ad92d337 Ensure fallbacks / progressive enhancement if JS disabled 2017-10-30 16:16:19 +01:00
ines
1eb1ed0c7c Add tool for model comparison (experimental)
User can select two model and their meta is fetched from GitHub. Features, accuracy figures and speed benchmarks are displayed in a table, with an additional chart comparing the accuracy scores if available. Main use case: demonstrating and visualising trade-offs between larger and smaller models of the same type.
2017-10-30 14:09:43 +01:00
ines
fb2710211b Integrate rollup into website build process 2017-10-30 14:08:26 +01:00
ines
38ef4274b6 Remove confusing icon for non-compatible models
ModelLoader will now output "not compatible" if no compatible version of model is found for a spaCy version
2017-10-30 14:07:42 +01:00
ines
8db3da3c3d Refactor JS, split into modules and add nomodule option
rollup.js will be compiled by the rollup package and Babel on build, and will be loaded if a browser doesn't yet support JS modules
2017-10-30 14:06:25 +01:00
ines
5453821a9f Update NER annotation scheme
Add note on training data sources and include coarse-grained Wikipedia scheme
2017-10-30 13:53:49 +01:00
ines
df149455f9 Don't ever wrap navigation bar contents 2017-10-30 13:16:20 +01:00
ines
74dd0ee2c2 Prevent responsive tables form scrolling vertically 2017-10-30 13:16:06 +01:00
ines
ae45446978 Remove comment 2017-10-30 13:15:46 +01:00
ines
25f6331550 Allow other style arguments on +grid-col 2017-10-30 13:15:30 +01:00
ines
08869c19fd Merge mixins and mixins-base
The distinction was never clear anyways and it was progressively getting messier. So all mixins live in one file now.
2017-10-30 13:15:13 +01:00
ines
ae2ad5becc Remove charts from model direcory and add speed benchmarks
With speed benchmarks, charts ended up taking up too much space – and they were mostly data porn and not particularly useful anyways. Instead, we might add a "Compare" page that fetches all models and lets the user compare two or more models in terms of accuracy, speed etc.
2017-10-29 03:58:19 +01:00
ines
47fd254ba7 Combine table scroll shadows if row has only one cell 2017-10-29 03:56:37 +01:00
ines
b11928abc2 Adjust labels, spacing and hack specificity 2017-10-29 03:56:09 +01:00
ines
af0ba014d2 Document +code-new and +code-old 2017-10-29 03:54:13 +01:00
ines
9b6828bd83 Add height option to +chart and document 2017-10-29 03:53:59 +01:00
ines
e18744823b Add placeholders for Italian and Portuguese models 2017-10-29 01:29:39 +02:00
ines
3b1cfa3455 Add GPL license link 2017-10-29 01:18:32 +02:00
ines
5147cdc468 Fix formatting and add missing v2 label 2017-10-29 01:18:09 +02:00
ines
53bfcdba31 Make tooltips/tags and old/new code blocks more accessible (see #(see #1471))
Always add tooltip text as hidden label. Use different tooltip icons for tags and inline help icons. Add labels to old/new code blocks and add option to customise label text.
2017-10-29 01:17:49 +02:00
ines
4a4f9666b2 Improve style/accessibility of yes/no/neutral icons (see #1471)
Use distinctive icons instead of only colour, add proper handling of labels (hidden or visible, but always present) with optional custom text.
2017-10-29 01:14:30 +02:00
ines
a8e10f94e4 Tidy up Lexeme and update docs 2017-10-27 21:07:50 +02:00
ines
5167a0cce2 Tidy up Vectors and docs 2017-10-27 19:45:19 +02:00
ines
544a407b93 Tidy up Doc, Token and Span and add missing docs 2017-10-27 17:07:26 +02:00
ines
6a0483b7aa Tidy up and document Doc, Token and Span 2017-10-27 15:41:45 +02:00
ines
298c3d973c Document Doc.get_lca_matrix 2017-10-27 14:37:53 +02:00
ines
9ff9afe889 Update spacy convert CLI docs 2017-10-27 14:37:42 +02:00
ines
52f1bf2729 Adjust GitHub embeds 2017-10-27 12:30:59 +02:00
Ines Montani
4033e70c71 Merge pull request #1461 from explosion/feature/disable-pipes
💫 Add Language.disable_pipes(), to temporarily edit pipeline and update code examples
2017-10-27 12:21:40 +02:00
ines
b5643d8575 Update intent parser docs and add to usage docs 2017-10-27 04:49:05 +02:00
ines
954c88f4d8 Fix formatting 2017-10-27 04:48:41 +02:00
ines
af28ca1ba0 Move example to pipeline directory 2017-10-27 02:00:01 +02:00
ines
1d69a46cd4 Update multi-processing example and add to docs 2017-10-27 01:58:55 +02:00
ines
647ef64f86 Update textcat docs 2017-10-27 00:51:29 +02:00
ines
a7b9074b4c Update textcat training example and docs 2017-10-27 00:48:45 +02:00
ines
cc2917c9e8 Update fastText example and add to examples in docs 2017-10-26 18:47:02 +02:00
ines
daed7ff8fe Update information extraction examples 2017-10-26 18:46:11 +02:00
ines
b90e958975 Update tagger and parser examples and add to docs 2017-10-26 16:27:42 +02:00
ines
0575e9cf20 Add parser example to docs 2017-10-26 16:12:34 +02:00
ines
281f88a59c Update NER training examples 2017-10-26 14:44:43 +02:00
ines
8116d1a077 Add note on biluo_tags_from_offsets helper 2017-10-26 14:44:32 +02:00
ines
9bf78d5fb3 Update spacy.explain docs 2017-10-26 13:04:25 +02:00
ines
96b4214303 Add notes on pipe template inheritance in docs 2017-10-26 12:57:32 +02:00
ines
e6536d231f Update new entity type training example in docs 2017-10-25 22:17:23 +02:00
ines
400812d9b1 Add add_label method to Pipe 2017-10-25 22:17:11 +02:00
ines
70de2dd035 Display vectors in models directory if available (see #1457) 2017-10-25 16:15:37 +02:00
ines
1a722dac31 Merge branch 'develop' into feature/disable-pipes 2017-10-25 15:18:18 +02:00
ines
0102561f34 Update docs 2017-10-25 13:57:55 +02:00
ines
68e9de6917 Add documentation 2017-10-25 13:57:14 +02:00
ines
3484174e48 Add Language.path 2017-10-25 11:57:43 +02:00
ines
c815ff65f6 Update feature list 2017-10-24 21:49:11 +02:00
ines
d71702b827 Fix formatting 2017-10-24 20:11:04 +02:00
ines
6686e53530 Allow GitHub embeds to specify optional language 2017-10-24 16:00:56 +02:00
ines
56a47f137f Add title description for tokenizer 2017-10-24 16:00:56 +02:00
ines
3944c1d6e7 Document lemmatizer 2017-10-24 16:00:56 +02:00
ines
c9dc88ddfc Document current JSON format for training 2017-10-24 16:00:56 +02:00
Matthew Honnibal
ef3e5a361b Merge pull request #1442 from explosion/feature/fix-sp
💫Fix SP tag, tweak Vectors.__init__, fix Morphology
2017-10-24 10:24:07 +02:00
Matthew Honnibal
fdf25d10ba Merge pull request #1440 from ramananbalakrishnan/develop
Support single value for attribute list in doc.to_array
2017-10-24 10:23:12 +02:00
ines
7701984f13 Document Span.as_doc 2017-10-23 10:38:27 +02:00
ines
db15902e84 Tidy up 2017-10-23 10:38:21 +02:00
ines
3f0a157b33 Fix typo 2017-10-23 10:38:13 +02:00
Matthew Honnibal
ebecaddb76 Make 'data_or_width' two keyword args in Vectors.__init__
Previously the data and width options were one argument in Vectors,
which meant you couldn't say vectors = Vectors(strings, width=300).
It's better to have two keywords.
2017-10-20 14:17:15 +02:00
ines
108f1f786e Update symbols and document missing token attributes (see #1439) 2017-10-20 13:08:44 +02:00
ines
4acab77a8a Add missing symbol for LAW entities (resolves #1427) 2017-10-20 13:07:57 +02:00
Ramanan Balakrishnan
d44a079fe3
Update documentation on doc.to_array 2017-10-20 14:25:38 +05:30
Matthew Honnibal
61bc203f3f Merge pull request #1438 from explosion/feature/fast-parser
💫 Improve runtime CPU efficiency of parser/NER
2017-10-19 02:42:21 +02:00
Matthew Honnibal
d4cfff0476 Comment out currently hard-coded hyper-params 2017-10-19 00:47:24 +02:00
Ines Montani
f0d577e460 Merge pull request #1425 from explosion/feature/hindi-tokenizer
💫 Basic Hindi tokenization support
2017-10-18 13:34:52 +02:00
ines
a74cba2ffa Remove Binder from docs (now covered by Doc API) 2017-10-17 16:27:19 +02:00
ines
8ca344712d Add Language.has_pipe method 2017-10-17 11:20:07 +02:00
ines
4cfe259266 Fix formatting 2017-10-16 20:36:41 +02:00
ines
18793efef1 Remove Russian from v2.0 docs for now 2017-10-16 20:36:36 +02:00
ines
d383612225 Add note about word vectors in example (see #1117) 2017-10-16 20:31:58 +02:00
Matthew Honnibal
010a7309ff Merge pull request #1402 from explosion/feature/fix-matcher-operators
💫 Fix Matcher variable-length operators
2017-10-16 17:53:19 +02:00
ines
63393b4e0d Update matcher docs to reflect operator changes 2017-10-16 13:44:12 +02:00
ines
15514dc333 Add section on upgrading 2017-10-14 22:14:47 +02:00
ines
c0aceb9fbe Add Hindi to supported languages 2017-10-14 15:16:41 +02:00
ines
a5da683578 Add Russian to alpha docs and update tokenizer dependencies 2017-10-14 12:52:41 +02:00
ines
a69f4e56e5 Remove outdated aside 2017-10-14 12:52:07 +02:00
ines
bb6ecb82e5 Ensure long file paths in code examples break if needed 2017-10-14 12:51:52 +02:00
ines
bfd9506f1d Update extensions docs and add resources 2017-10-13 00:18:13 +02:00
ines
5f5d6897e8 Increment version 2017-10-13 00:18:02 +02:00
ines
9fd68334ab Add validate command docs 2017-10-12 23:36:48 +02:00
Ines Montani
37aa523a8e Merge pull request #1408 from explosion/feature/dot-underscore
💫 Custom attributes via Doc._, Token._ and Span._
2017-10-11 18:35:56 +02:00
ines
eac9e99086 Update docs on adding lemmatization to languages 2017-10-11 14:21:15 +02:00
ines
f4ae6763b9 Fix consistency of imports from spacy.tokens in examples 2017-10-11 02:30:40 +02:00
ines
19598ebfee Update migration guide 2017-10-10 06:38:11 +02:00
ines
9c96a6e131 Update pipelines section in v2 overview 2017-10-10 06:33:53 +02:00
Matthew Honnibal
09d61ada5e Merge pull request #1396 from explosion/feature/pipeline-management
💫 Improve pipeline and factory management
2017-10-10 04:29:54 +02:00
ines
6679117000 Add pipeline component examples 2017-10-10 04:26:06 +02:00
ines
7a592d01dc Update pipeline component usage docs 2017-10-10 04:24:39 +02:00
ines
3d5154811a Fix typo 2017-10-10 04:24:22 +02:00
ines
43b70651fb Document extension methods on Doc, Token and Span
set_extension, get_extension, has_extension
2017-10-10 04:23:37 +02:00
ines
b4fc6b203c Rename mixin 2017-10-10 04:22:23 +02:00
ines
de374dc72a Merge branch 'feature/pipeline-management' into feature/dot-underscore 2017-10-09 14:37:51 +02:00
ines
6c253db3fe Add section for developing spaCy extensions 2017-10-09 14:36:56 +02:00
ines
6550d0547c Fix typo 2017-10-09 14:36:36 +02:00
ines
4d248ea920 Fix spacing on bulleted lists 2017-10-09 14:36:30 +02:00
ines
2ac8b5c622 Add wrapper for before/after code examples 2017-10-09 14:36:20 +02:00
ines
ca6769fd48 Update spacy functions and remove removed set_factory 2017-10-07 15:28:01 +02:00
ines
743d1df1fe Update pipelines docs and add user hooks to custom components 2017-10-07 15:27:28 +02:00
Matthew Honnibal
eb0595bea9 Merge pull request #1392 from explosion/feature/parser-history-model
💫 Parser history features
2017-10-07 15:07:02 +02:00
ines
d70cf19158 Fix formatting 2017-10-07 15:06:38 +02:00
ines
c970b4f226 Add missing token attribute 2017-10-07 15:04:16 +02:00
ines
37f755897f Update rule-based matching docs 2017-10-07 15:04:09 +02:00
Matthew Honnibal
e22067e3b5 Document new hyper-parameters 2017-10-07 07:10:10 -05:00
ines
feaf353051 Update processing pipelines usage docs 2017-10-07 14:05:59 +02:00
ines
58dfde7c02 Remove redundante deprecation note 2017-10-07 04:54:57 +02:00
ines
ed8e0085b0 Update docs for spacy.load() 2017-10-07 03:06:55 +02:00
ines
e370332fb1 Update Language API docs 2017-10-07 03:00:20 +02:00
ines
3468d535ad Update model benchmarks 2017-10-06 21:39:06 +02:00
ines
96a4e79d13 Fix PhraseMatcher example 2017-10-06 18:22:10 +02:00
ines
bb13aa4bf3 Fix typos in PhraseMatcher docs 2017-10-04 16:12:09 +02:00
ines
33cf9cecdd Port over changes from #1386 2017-10-04 13:34:03 +02:00
ines
36ff525ff5 Add NER P and NER R scores to model overview 2017-10-04 00:37:15 +02:00
ines
15ec7ddd09 Add docs for new spacy evaluate command 2017-10-04 00:19:03 +02:00
ines
464f14019d Fix typos 2017-10-04 00:18:47 +02:00
ines
bfb512f45a Add website package.json and fix gitignore 2017-10-04 00:18:41 +02:00
ines
80a2fb6193 Update visualizers docs and add submenu 2017-10-03 19:40:39 +02:00
ines
5fb057b575 Fix secondary font stack 2017-10-03 15:45:07 +02:00
ines
b24fbd8aad Fix titles for social cards 2017-10-03 14:54:33 +02:00
ines
23019d1daa Add styleguide 2017-10-03 14:28:24 +02:00
ines
319fac14fe Update global config and landing page 2017-10-03 14:28:18 +02:00
ines
22dd929b65 Add models documentation 2017-10-03 14:28:03 +02:00
ines
808f7ee417 Update API documentation 2017-10-03 14:27:22 +02:00
ines
3f4fd2c5d5 Update usage documentation 2017-10-03 14:26:20 +02:00
ines
9af604f0da Update layout templates, partials and mixins 2017-10-03 14:20:13 +02:00
ines
49b58d35fd Update JavaScript 2017-10-03 14:18:49 +02:00
ines
a8ff8423bb Update image assets, icons and SVGs
Move SVG sprite to Jade file and include in template. Only use SVG
symbols for logos.
2017-10-03 14:17:41 +02:00
ines
7d01d7411b Update web fonts 2017-10-03 14:15:36 +02:00
ines
3e1b971b16 Update CSS 2017-10-03 14:14:52 +02:00
Reza Gharibi
0461b82158 Fix typos 2017-09-27 03:56:20 +03:30
Reza Gharibi
fa1844b132 Fix typo 2017-09-27 03:55:54 +03:30
Reza Gharibi
b5dd7e7cc4 Fix typo 2017-09-27 03:55:28 +03:30
Ines Montani
b8e81daccf Fix typo (closes #1312) 2017-09-14 12:49:59 +02:00
ines
d15775c3ad Fix typos and commands in alpha docs 2017-08-21 13:40:11 +02:00
ines
3c33003078 Port over typo corrections from #1245 2017-08-20 12:00:17 +02:00
ines
1261b01e46 Update Doc.char_span docs 2017-08-19 16:34:32 +02:00
ines
5cb0200e63 Document new Span.to_array() method 2017-08-19 12:45:28 +02:00
ines
471eed4126 Add example to Span.merge() 2017-08-19 12:45:16 +02:00
ines
404d3067b8 Document new Doc.char_span() method 2017-08-19 12:45:00 +02:00
ines
d53cbf369f Document as_tuples kwarg on Language.pipe() 2017-08-19 12:44:50 +02:00
ines
6a37c93311 Update argument type 2017-08-19 12:44:33 +02:00
ines
4731d50220 Add break utility for long nowrap items (e.g. code) 2017-08-19 12:44:23 +02:00
ines
0aba11b64b Update package command docs 2017-08-14 16:45:44 +02:00
ines
52c6302223 Allow prompt setting on code mixin 2017-08-14 13:05:01 +02:00
ines
a29f132ffd Change python -m spacy to spacy
Reflects latest change to entry point or auto-alias
2017-08-14 13:04:48 +02:00
Nikolai Kruglikov
08e443e083 Fix small typo in documentation 2017-08-14 12:19:04 +02:00
ines
ab8ffbaab7 Add text classification to v2 overview 2017-07-22 17:56:51 +02:00
ines
f085b88f9d Add TextCategorizer API docs stub 2017-07-22 17:56:33 +02:00
ines
ab1a4e8b3c Add Tensorizer API docs stub 2017-07-22 17:56:25 +02:00
ines
0fb89dd204 Add text classification usage guide template 2017-07-22 17:56:07 +02:00
ines
d05ab1b3a0 Add text classification to 101 overview and change order 2017-07-22 17:55:53 +02:00
ines
d2a7e5b8e5 Add GoldParse.cats attribute 2017-07-22 17:55:35 +02:00
ines
23d976ed00 Add Doc.cats attribute and missing v2 tag 2017-07-22 17:55:14 +02:00
Ines Montani
1ddbeddca2 Fix typo 2017-07-22 15:00:58 +02:00
Jarle Mathiesen
f20533ec0c fix small typo 2017-06-24 12:31:33 +02:00
Savva Kolbachev
800a8faff4 Changed the capital of Lithuania to Vilnius
Hi,
There is a typo about the capital of Lithuania.

Vilnius is the capital of Lithuania https://en.wikipedia.org/wiki/Vilnius
Ljubljana is the capital of Slovenia https://en.wikipedia.org/wiki/Ljubljana
2017-06-12 23:27:00 +03:00
Ines Montani
57f64b9e1c Merge pull request #1124 from v3t3a/patch-3
docs - Fix url error for Displacy Ent visualizer
2017-06-12 21:20:32 +02:00
Ines Montani
b2a28028cf Merge pull request #1115 from v3t3a/patch-2
docs - Add read() method when opening file (Lightning tour)
2017-06-12 21:19:25 +02:00
Ines Montani
fe8d136ae0 Merge pull request #1114 from v3t3a/patch-1
docs - Update doc.jade (Just remove a duplicate 'doc =')
2017-06-12 21:19:02 +02:00
Vetea
eae1f7b19c Fix url error for Displacy Ent visualizer 2017-06-12 14:30:02 +02:00
ines
49026a1346 Fix typos in example (see #1105) 2017-06-08 19:15:50 +02:00
Vetea
cc3aee1189 Add read() method when opening file
Add read() method for 

to avoid :
```TypeError: Argument 'string' has incorrect type (expected str, got _io.TextIOWrapper)```

Test with:
spaCy : v2.0.0 Alpha
python : 3.5.2+ (default, Sep 22 2016, 12:18:14)
2017-06-08 11:27:09 +02:00
Vetea
8e20cf6368 Update doc.jade
Just remove a duplicate 'doc ='
2017-06-08 10:35:58 +02:00
ines
6b799bac54 Fix formatting and details 2017-06-06 14:37:49 +02:00
ines
6c34b1a65b Update alpha thread link 2017-06-06 00:58:12 +02:00
ines
c921ba109a Fix robots and meta 2017-06-05 20:07:52 +02:00
ines
fd9ae0f0e0 Update v2 comparison table 2017-06-05 16:39:11 +02:00
ines
a3f9745a14 Update similarity usage guide and examples 2017-06-05 15:37:33 +02:00
ines
fd35d910b8 Update v2 docs and benchmarks 2017-06-05 14:13:38 +02:00
ines
9f55c0d4f6 Add Vectors class 2017-06-05 13:33:11 +02:00
ines
040553ca59 Update architecture and features table 2017-06-05 13:33:01 +02:00
ines
e204788c30 Add docs for util.load_model_from_path 2017-06-05 13:18:22 +02:00
ines
efc37ea3de Update train CLI 2017-06-04 23:45:14 +02:00
ines
505d43b832 Update norms example 2017-06-04 23:33:26 +02:00
ines
f8e93b6d0a Update norms example 2017-06-04 23:24:29 +02:00
ines
a857b2b511 Update norms example 2017-06-04 23:21:37 +02:00
ines
47d066b293 Add under construction 2017-06-04 23:17:54 +02:00
ines
e9816daa6a Add details on syntax iterators 2017-06-04 23:16:33 +02:00
ines
6438428ce8 Update v2 infobox 2017-06-04 22:09:33 +02:00
ines
990cb81556 Add info on syntax iterators 2017-06-04 21:47:22 +02:00
ines
e4eb33daf7 Add links to production use guide 2017-06-04 20:56:58 +02:00
ines
63cd539d04 Add more details on model packages and requirements.txt (see #1099) 2017-06-04 20:52:10 +02:00
ines
97ff83d163 Fix docs on model loading 2017-06-04 20:44:59 +02:00
ines
b6002db797 Add v2 label 2017-06-04 18:53:03 +02:00
ines
e76baccd51 Add alpha social image 2017-06-04 18:43:14 +02:00
ines
468ff1a7dd Update v2 docs and add benchmarks stub 2017-06-04 15:34:28 +02:00
Matthew Honnibal
23fd6b1782 Add intro narrative for v2 2017-06-04 15:10:37 +02:00
ines
3419ecbfdd Update docs on model shortcut links 2017-06-04 13:55:00 +02:00
ines
586e901143 Add v2 intro stub 2017-06-04 13:42:37 +02:00
ines
4f8f62d9b3 Merge branch 'v2-docs-edits' into develop 2017-06-04 13:40:58 +02:00
ines
809903dcad Fix link and update wording 2017-06-04 13:29:20 +02:00
ines
22dd18c364 Remove redundant CPU commands 2017-06-04 13:29:13 +02:00
ines
1d6377218a Update architecture blurb and move other info 2017-06-04 13:28:58 +02:00
ines
eb66625c69 Also add disallow robots.txt for alpha mode 2017-06-04 13:14:32 +02:00
ines
7a66c9f039 Fix formatting 2017-06-04 13:14:00 +02:00
Matthew Honnibal
f2c4a9f690 Edits to spacy-101 page 2017-06-04 13:10:27 +02:00
Matthew Honnibal
aca53b95e1 Link architecture blurb 2017-06-04 13:10:06 +02:00
Matthew Honnibal
64ca5123bb Add Architecture 101 blurb 2017-06-04 13:09:19 +02:00
Matthew Honnibal
e77ed953f4 Update GPU instructions 2017-06-04 12:03:22 +02:00
ines
1d3b012e56 Update adding languages docs and add 101 2017-06-03 23:54:23 +02:00
ines
a3715a81d5 Update adding languages guide 2017-06-03 22:16:38 +02:00
ines
ec6d2bc81d Add table of contents mixin 2017-06-03 22:16:26 +02:00
ines
9acf8686f7 Update note on compact mode issues 2017-06-03 13:31:16 +02:00
ines
b0225183c2 Update displaCy defaults 2017-06-03 13:27:06 +02:00
ines
c60431357d Port over docs typo corrections 2017-06-03 11:31:30 +02:00
ines
9064fbbf1e Fix empty arguments in mixins 2017-06-01 18:57:02 +02:00
ines
8bee34126d Update model size 2017-06-01 18:22:35 +02:00
ines
6c908700c4 Add alpha badge 2017-06-01 18:20:33 +02:00
ines
c6dc2fafc0 Add Spanish and move example sentences to meta 2017-06-01 17:49:56 +02:00
ines
1bebc6392c Add source files to pipeline components 2017-06-01 17:38:06 +02:00
ines
b577ed79ee Move social image logic out to function and move files 2017-06-01 14:27:44 +02:00
ines
8fc52878f7 Make graphic smaller 2017-06-01 13:03:54 +02:00
ines
5e60b09dcd Fix custom tokenizer example 2017-06-01 13:02:50 +02:00
ines
706cec6d58 Move annotation specs up 2017-06-01 13:02:43 +02:00
ines
fd77917c5a Remove bottom padding from sidebar 2017-06-01 13:02:36 +02:00
ines
8274dffad6 Update NER training draft 2017-06-01 12:51:36 +02:00
ines
04fac3f52a Add NER training example code 2017-06-01 12:47:47 +02:00
ines
7f5e7e7320 Fix typo 2017-06-01 12:47:36 +02:00
ines
5cef1dd305 Always use develop branch of GitHub links in ALPHA mode 2017-06-01 12:47:30 +02:00
ines
4a927154d8 Update v2 docs 2017-06-01 11:56:32 +02:00
ines
03bbb96db8 Remove outdated examples 2017-06-01 11:56:02 +02:00
ines
789e69b73f Update training guide 2017-06-01 11:53:23 +02:00
ines
2f40d6e7e7 Add training 101 2017-06-01 11:53:16 +02:00
ines
abed463bbb Update serialization 101 2017-06-01 11:52:58 +02:00
ines
72380c952a Update training section in NER guide and add links 2017-06-01 11:52:49 +02:00
ines
77dca25c7f Update Language API docs 2017-06-01 11:51:31 +02:00
ines
9c975c4882 Add training illustrations 2017-06-01 11:51:22 +02:00
ines
bea6e6bfad Allow annotation row to take children 2017-06-01 11:51:14 +02:00
ines
22b1f72870 Add spaCy 101 intro 2017-05-31 12:44:09 +02:00
ines
a18b95ca12 Update docs on testing 2017-05-31 12:43:40 +02:00
ines
baa6070548 Fix ID of quickstart group to avoid conflicts 2017-05-31 12:43:30 +02:00
ines
981196c181 Fix typo 2017-05-31 11:34:31 +02:00
ines
f86289566a Update new in v2 section and add note on Matcher acceptors 2017-05-30 13:53:06 +02:00
ines
ce4e45d0bb Update 101 intro 2017-05-29 22:15:06 +02:00
ines
b5bfab8699 Add description 2017-05-29 15:27:16 +02:00
ines
687ed28340 Update processing pipelines guide 2017-05-29 14:21:00 +02:00
ines
d5992f408f Update note on vocab consistency 2017-05-29 14:14:26 +02:00
ines
567485a818 Fix and document model loading with pipeline and overrides 2017-05-29 14:10:10 +02:00
ines
a2134951f2 Update 101 and add note on pipeline order and tensors 2017-05-29 11:45:32 +02:00
ines
17b635eaab Update alpha docs note and fix typo 2017-05-29 11:09:24 +02:00
ines
fbe105f1eb Add note on L in long integers in Python 2 2017-05-29 11:05:05 +02:00
ines
9d74810f6f Update examples 2017-05-29 01:09:52 +02:00
ines
42cf414138 Update Matcher example 2017-05-29 01:09:52 +02:00
ines
00b2094dc3 Fix typos, long integers and tests 2017-05-29 01:09:52 +02:00
ines
18b8050b07 Revert "Update syntax highlighting regex for long integers"
This reverts commit 11f2e80c6a.
2017-05-29 01:09:52 +02:00
ines
d71c6db76e Add missing Chainer install for GPU if building spaCy from source 2017-05-28 23:34:59 +02:00
ines
e0f9ccdaa3 Update texts and rename vectorizer to tensorizer 2017-05-28 23:26:13 +02:00
ines
606879b217 Update hash strings examples 2017-05-28 19:42:44 +02:00
ines
c7b57ea314 Update docs and change integer IDs to hash values 2017-05-28 19:25:34 +02:00
ines
738b4f7187 Add quickstart options and docs for GPU 2017-05-28 19:20:11 +02:00
ines
4c00cb8c8b Update 101 and add community/FAQ and table of contents 2017-05-28 18:45:49 +02:00
ines
0ea31d1e31 Add under construction note to pipeline components 2017-05-28 18:44:07 +02:00
ines
8a148b6563 Fix code, links and formatting 2017-05-28 18:29:16 +02:00
ines
11f2e80c6a Update syntax highlighting regex for long integers 2017-05-28 18:24:29 +02:00
ines
414193e9ba Update docs to reflect StringStore changes 2017-05-28 18:19:11 +02:00
ines
69bda9aed7 Update text, examples, typos, wording and formatting 2017-05-28 16:41:01 +02:00
ines
f8185b8e11 Rename vocab-stringsotre to vocab 2017-05-28 16:37:14 +02:00
ines
57ea94f0e3 Add markdown icon 2017-05-28 16:36:47 +02:00
ines
bd79e683f6 Move code block border to own modifier class 2017-05-28 16:36:42 +02:00
ines
20ffb56148 Fix overwriting of navigation in ALPHA mode 2017-05-28 16:36:31 +02:00
ines
189db308d9 Only add coloured border to code block if icon has colour 2017-05-28 16:36:21 +02:00
ines
b85d88fac6 Update quickstart mixin to make it more customisable 2017-05-28 16:36:07 +02:00
ines
10d05c2b92 Fix typos, wording and formatting 2017-05-28 01:30:12 +02:00
ines
eb5a8be9ad Update language overview and add section on 'xx' lang class 2017-05-28 01:15:44 +02:00
ines
01a7b10319 Add fallback fonts to illustrations 2017-05-28 00:32:54 +02:00
ines
eb703f7656 Update API docs 2017-05-28 00:32:43 +02:00
ines
c1983621fb Update util functions for model loading 2017-05-28 00:22:40 +02:00
ines
db116cbeda Update tokenization 101 and add illustration 2017-05-28 00:22:40 +02:00
ines
b03fb2d7b0 Update 101 and usage docs 2017-05-28 00:22:40 +02:00
ines
ae11c8d60f Add emoji sentiment to lightning tour matcher example 2017-05-27 20:02:20 +02:00
ines
22bf5f63bf Update Matcher docs and add social media analysis example 2017-05-27 17:58:18 +02:00
ines
0d33ead507 Fix initialisation of Doc in lightning tour example 2017-05-27 17:58:06 +02:00
ines
e05bcd6aa8 Update docs to reflect flattened model meta.json
Don't use "setup" key and instead, keep "lang" on root level and add
"pipeline".
2017-05-27 17:57:46 +02:00
ines
70afcfec3e Update defaults and example 2017-05-26 14:04:31 +02:00
ines
1b982f0838 Update train command and add docs on hyperparameters 2017-05-26 14:02:38 +02:00
ines
1b9c6ded71 Update API docs and add "source" button to GH source 2017-05-26 13:40:32 +02:00
ines
93ee5c4a52 Update serialization info 2017-05-26 13:22:45 +02:00
ines
f122d82f29 Update usage docs and ddd "under construction" 2017-05-26 13:17:48 +02:00
ines
286c3d0719 Update usage and 101 docs 2017-05-26 12:46:29 +02:00
ines
6d76c1ea16 Add 101 for Vocab, Lexeme and StringStore 2017-05-26 12:45:01 +02:00
ines
d8fd002e59 Add illustration for Vocab & StringStore 2017-05-26 12:43:49 +02:00
ines
a7de5f0155 Update SVG illustrations and use unique CSS classes 2017-05-26 12:43:38 +02:00
ines
d48530835a Update API docs and fix typos 2017-05-26 12:43:16 +02:00
ines
ea9474f71c Add version tag mixin to label new features 2017-05-26 12:42:36 +02:00
ines
10ca6d1507 Set additional min-width on icons
Prevents icons from being scaled in flexbox containers
2017-05-26 12:39:59 +02:00
ines
353f0ef8d7 Use disable argument (list) for serialization 2017-05-26 12:33:54 +02:00
ines
9063654a1a Add Training 101 stub 2017-05-25 11:18:02 +02:00
ines
b2324be3e9 Fix typos, text, examples and formatting 2017-05-25 11:17:21 +02:00
ines
dcb10da615 Update and fix lightning tour examples 2017-05-25 11:15:56 +02:00
ines
4b5540cc63 Rewrite examples in lightning tour 2017-05-25 01:58:33 +02:00
ines
87c976e04c Update model tag 2017-05-25 01:58:22 +02:00
ines
fe2b0b8b8d Update migrating docs 2017-05-25 00:56:35 +02:00
ines
709ea58990 Tidy up workflows 2017-05-25 00:56:16 +02:00
ines
d122bbc908 Rewrite custom tokenizer docs 2017-05-25 00:30:21 +02:00
ines
0f48fb1f97 Rename processing text to production use and remove linear feature scheme 2017-05-25 00:10:33 +02:00
ines
419d265ff0 Add section on disabling pipeline components 2017-05-25 00:10:06 +02:00
ines
9efa662345 Update dependency parse docs and add note on disabling parser 2017-05-25 00:09:51 +02:00
ines
9337866dae Add aside to pipeline 101 table 2017-05-24 22:46:18 +02:00
ines
c25f3133ca Update section on new v2.0 features 2017-05-24 20:54:37 +02:00
ines
f4658ff053 Rewrite usage workflow on saving and loading 2017-05-24 20:54:02 +02:00
ines
764bfa3239 Add section on using displaCy in a web app 2017-05-24 20:53:43 +02:00
ines
4f396236f6 Update saving and loading docs 2017-05-24 19:25:49 +02:00
ines
8aaed8bea7 Add pipelines 101 and rewrite pipelines workflow 2017-05-24 19:25:13 +02:00
ines
54885b5e88 Add serialization 101 2017-05-24 19:24:40 +02:00
ines
b546bcb05f Add pipeline illustration 2017-05-24 19:21:18 +02:00
ines
823d22100b Tidy up architecture.svg 2017-05-24 19:21:12 +02:00
ines
8b86b08bed Update usage workflows 2017-05-24 11:59:08 +02:00
ines
66088851dc Add Doc.to_disk() and Doc.from_disk() methods 2017-05-24 11:58:17 +02:00
ines
10afb3c796 Tidy up and merge usage pages 2017-05-24 00:37:47 +02:00
ines
990a70732a Move installation troubleshooting to installation docs 2017-05-24 00:37:21 +02:00
ines
697d3d7cb3 Fix links to CLI docs 2017-05-24 00:36:38 +02:00
ines
4fb5fb7218 Update v2 docs 2017-05-23 23:40:04 +02:00
ines
e6d88dfe08 Add features table to 101 2017-05-23 23:38:33 +02:00
ines
7ef7f0b42c Add linguistic annotations 101 content 2017-05-23 23:37:51 +02:00
ines
9ed6b48a49 Update dependency parse workflow 2017-05-23 23:34:39 +02:00
ines
fe24267948 Update usage docs meta and navigation 2017-05-23 23:19:20 +02:00
ines
af348025ec Update word vectors & similarity workflow 2017-05-23 23:19:09 +02:00
ines
b6c62baab3 Update What's new in v2 docs 2017-05-23 23:18:53 +02:00
ines
b6209e2427 Update POS tagging workflow 2017-05-23 23:18:08 +02:00
ines
43258d6b0a Update NER workflow 2017-05-23 23:17:57 +02:00
ines
61cf2bba55 Fix code example 2017-05-23 23:17:37 +02:00
ines
1c06ef3542 Update spaCy architecture 2017-05-23 23:17:25 +02:00
ines
a433e5012a Update adding languages docs 2017-05-23 23:16:44 +02:00
ines
3523715d52 Add spaCy 101 components 2017-05-23 23:16:31 +02:00
ines
a38393e2f6 Update annotation docs 2017-05-23 23:16:17 +02:00
ines
786af87ffb Update IOB docs 2017-05-23 23:15:50 +02:00
ines
3aff883434 Add displaCy examples to lightning tour 2017-05-23 23:15:39 +02:00
ines
6ef09d7ed8 Change save_to_directory to to_disk 2017-05-23 23:15:31 +02:00
ines
c8bde2161c Add kwargs to spacy.load 2017-05-23 23:14:02 +02:00
ines
0a8a2d2f6d Remove tip infoboxes from annotation docs 2017-05-23 23:13:51 +02:00
ines
00ede349dc Add table row for linguistic annotations 2017-05-23 23:13:37 +02:00
ines
7e5163402e Allow clipping code block to height and add docs 2017-05-23 23:13:26 +02:00
ines
05761e1750 Allow size on procon icon 2017-05-23 23:11:38 +02:00
ines
e6acd3bbf2 Fix matcher tests and matcher docs 2017-05-23 11:36:02 +02:00
ines
f497cf60b2 Update formatting 2017-05-23 11:32:25 +02:00
ines
4cd26bcb83 Update docs on rule-based matching and add examples 2017-05-22 19:04:02 +02:00
ines
701cba1524 Update models documentation with notes 2017-05-22 18:53:14 +02:00
ines
a23f487b06 Tidy up displaCy and add "manual" option
Also don't require title in EntityRenderer
2017-05-22 18:48:20 +02:00
ines
aa9c3bd464 Fix formatting 2017-05-22 13:55:01 +02:00
ines
dddad5bf26 Update util.prints docs 2017-05-22 13:54:52 +02:00
ines
d5a6a9a6a9 Use string values for attrs in Matcher docs 2017-05-22 13:54:45 +02:00
ines
54f04a9fe0 Update API docs with changes in spacy.gold and spacy.language 2017-05-22 12:29:30 +02:00
ines
fc3ec733ea Reduce complexity in CLI
Remove now redundant model command and move plac annotations to cli
files
2017-05-22 12:28:58 +02:00
ines
cc569a348d Add quickstart widget to models and update docs
Add global variable for models and generate all model listings
programmatically
2017-05-21 20:55:52 +02:00
ines
a87da31271 Fix formatting and add subtle borders for tooltips on dark backgrounds 2017-05-21 20:51:30 +02:00
ines
5c06cf71ab Add different options for styling bash, python and divider commands 2017-05-21 20:51:17 +02:00
ines
0864a8ddd8 Allow desctiption, group help, fix help icon and add style option to commands 2017-05-21 20:51:00 +02:00
ines
f56cdf4ed1 Add quickstart.js note to mixin 2017-05-21 20:50:11 +02:00
ines
2c5cfe8bbf Update docstrings and API docs for StringStore 2017-05-21 14:18:58 +02:00
ines
251346b59f Fix typos and formatting 2017-05-21 14:18:46 +02:00
ines
075f5ff87a Update docstrings and API docs for GoldParse 2017-05-21 13:53:46 +02:00
ines
465a1dd710 Add BILUO scheme to annotation docs 2017-05-21 13:53:34 +02:00
ines
c9f04f3cd0 Add note on automated processes to download command 2017-05-21 13:23:39 +02:00
ines
869dbf92ce Add option for code blocks with (colored) icons
Plus "old" / "new" style with green accept / red reject icon
2017-05-21 13:22:34 +02:00
ines
8ab59515b2 Fix typo and use consistent description for from_bytes 2017-05-21 13:18:39 +02:00
ines
c5a653fa48 Update docstrings and API docs for Tokenizer 2017-05-21 13:18:14 +02:00
ines
d82ae9a585 Change "function" to "callable" in docs 2017-05-21 13:17:40 +02:00
ines
ee3fdffffb Move attributes and remove deprecated methods 2017-05-21 01:18:31 +02:00
ines
1cb2c86f9a Update CLI docs 2017-05-21 01:13:05 +02:00
ines
272a8981c3 Add model tag to spacy.load API docs 2017-05-21 01:12:43 +02:00
ines
b53ed82f0f Fix +tag-model mixin to check for length of spread arguments 2017-05-21 01:12:30 +02:00
ines
3871157d84 Update spacy.util documentation 2017-05-21 01:12:09 +02:00
ines
da12aee0c1 Update spacy.load with note on get_lang_class 2017-05-21 00:19:26 +02:00
ines
924e8506de Move Defaults subclass to module scope (necessary for pickling) 2017-05-20 19:02:27 +02:00
ines
27de0834b2 Update docstrings and API docs for Lexeme 2017-05-20 15:13:42 +02:00
ines
7ed8a92ed1 Update docstrings and API docs for Token 2017-05-20 15:13:33 +02:00
ines
4ed6a36622 Update docstrings and API docs for Matcher 2017-05-20 14:43:10 +02:00
ines
39f36539f6 Update docstrings and API docs for Matcher 2017-05-20 14:32:34 +02:00
ines
c00ff257be Update docstrings and API docs for Matcher 2017-05-20 14:26:10 +02:00
ines
463e3cc80f Remove resize_vectors and vectors_length 2017-05-20 14:02:14 +02:00
ines
b218c1964a Update "What's new in v2.0" docs 2017-05-20 14:00:41 +02:00
ines
fec16d1649 Add option for "alpha mode" with warning and green theme 2017-05-20 14:00:41 +02:00
ines
f0cc642bb9 Update docstrings and API docs for Vocab 2017-05-20 14:00:41 +02:00
Matthew Honnibal
a93276bb78 Merge branch 'develop' of https://github.com/explosion/spaCy into develop 2017-05-20 13:55:12 +02:00
Matthew Honnibal
ce9234f593 Update Matcher API 2017-05-20 13:54:53 +02:00
ines
8b14476253 Fix typo 2017-05-20 13:00:13 +02:00
ines
6557ff9e85 Update example 2017-05-20 13:00:07 +02:00
ines
fea4925f41 Reorganise API docs navigation 2017-05-20 12:59:57 +02:00
ines
b2678372c7 Add API docs for top-level spaCy functions
i.e. spacy.load(), spacy.info(), spacy.explain()
2017-05-20 12:59:44 +02:00
ines
797f10ab16 Update formatting 2017-05-20 12:59:16 +02:00
ines
e10c48210d Update Matcher API and workflow to reflect new API
on_match is now the second positional argument, to easily allow a
variable number of patterns while keeping the method clean and readable.
2017-05-20 12:59:03 +02:00
ines
eb521af267 Fix formatting 2017-05-20 12:58:15 +02:00
ines
7973912114 Update CLI docs 2017-05-20 12:58:05 +02:00
ines
4587fdf3c9 Set API mixin to nowrap to not break between text and icon 2017-05-20 12:57:46 +02:00
ines
eb3fcc7fc5 Add green theme 2017-05-20 12:57:27 +02:00
ines
9edc7fb0ba Update Matcher API docs 2017-05-20 12:27:22 +02:00
ines
5163a4513e Update API docs 2017-05-20 01:43:48 +02:00
ines
784347160d Rewrite rule-based matching workflow 2017-05-20 01:38:55 +02:00
ines
7f9539da27 Fix old download command and formatting 2017-05-20 01:38:43 +02:00
ines
e3256e7406 Update Matcher API docs 2017-05-20 01:38:34 +02:00
ines
0cabf9e13f Fix model tag 2017-05-20 01:38:14 +02:00
ines
fe5d8819ea Update Matcher docstrings and API docs 2017-05-19 21:47:06 +02:00
ines
c8580da686 Update "requires model" tags 2017-05-19 20:24:46 +02:00
ines
8ef6bfebca Fix resetting of tooltip font 2017-05-19 20:24:32 +02:00
ines
ce095fdcde Add +tag-model mixin to label functionality that requires model
Usage: +tag-model("vectors")
2017-05-19 20:24:17 +02:00
ines
c3e903e4c2 Update examples and API docs 2017-05-19 19:59:02 +02:00
ines
e9e62b01b0 Update docstrings and API docs for Token 2017-05-19 18:47:56 +02:00
ines
62ceec4fc6 Update docstrings and API docs for Span 2017-05-19 18:47:46 +02:00
ines
23f9a3ccc8 Update docstrings and API docs for Doc 2017-05-19 18:47:39 +02:00
ines
2c8c9dc0c9 Update docstrings and API docs for Language 2017-05-19 18:47:24 +02:00
ines
c765e752f2 Adjust inline code colour to theme 2017-05-19 01:05:25 +02:00
ines
89f850eafa Use coloured icons for +api and +src 2017-05-19 01:05:16 +02:00
ines
0791f0aae6 Update docstrings and API docs for Span class 2017-05-19 00:31:31 +02:00
ines
5b68579eb8 Use returns/yields instead of return/yield 2017-05-19 00:02:34 +02:00
ines
b687ad109d Update docstrings and API docs for Doc class 2017-05-18 23:59:44 +02:00
ines
d42bc16868 Update docstrings and API docs for Language class 2017-05-18 23:57:38 +02:00
ines
b87066ff10 Update docstrings and API docs for Doc class 2017-05-18 22:17:41 +02:00
ines
0f513850ab Use same colour for table foot rot and infobox 2017-05-18 22:17:41 +02:00
ines
476b8209fe Update docs with new Jupyter auto-detection 2017-05-18 14:58:17 +02:00
ines
532927a3c4 Update quickstart 2017-05-17 17:35:56 +02:00
ines
11f52b8b83 Add headline to installation details and move aside 2017-05-17 12:04:03 +02:00
ines
533bb63816 Implement quickstart widget 2017-05-17 12:04:03 +02:00
ines
7b9466f625 Add mixins and styles for quickstart widget 2017-05-17 12:04:03 +02:00
ines
2e875c40a8 Add quickstart.js 2017-05-17 12:04:03 +02:00
ines
9df9a87d03 Add visualizer usage example 2017-05-17 12:04:03 +02:00
ines
6364a9be9d Add What's new and spaCy 101 stubs 2017-05-17 12:04:03 +02:00
ines
f4ae1e8750 Add section on adding titles to documents 2017-05-17 12:04:03 +02:00
ines
9979901b6f Fix formatting 2017-05-17 12:04:03 +02:00
ines
452d16d7a9 Rename API menu item to "Reference" 2017-05-17 12:04:03 +02:00
ines
02a4841e7b Move CLI docs to API reference 2017-05-17 12:04:03 +02:00
ines
95307d1e3c Add mixin to display help icon with tooltip 2017-05-17 12:04:03 +02:00
ines
fec918ba2c Update icons 2017-05-17 12:04:03 +02:00
ines
fb23799114 Add tooltips component 2017-05-17 12:04:03 +02:00
ines
accf05b0a9 Update visualizers docs 2017-05-15 14:37:01 +02:00
ines
d7244ae72d Add docs on collapse_punct option 2017-05-15 13:51:33 +02:00
ines
6d7986b7bc Update docs 2017-05-15 01:46:33 +02:00
ines
c6e8d55dcb Update NER workflow with new displaCy 2017-05-15 01:42:11 +02:00
ines
860a60e251 Fix explanation 2017-05-15 01:31:11 +02:00
ines
5c044cb670 Add visualizers usage docs 2017-05-15 01:25:18 +02:00
ines
c33bdeb564 Use uppercase for entity types 2017-05-15 01:24:57 +02:00
ines
3d37564a09 Remove resources from navigation for now
Not sure what to do with this page... maybe merge it with something
else?
2017-05-14 23:29:58 +02:00
ines
cf7e5ed534 Use American spelling for "visualizers"
Kinda sucks because we normally use British spelling, but it just looks
weird and confusing otherwise... same with tokenizer and all other
library internals. So this is sort of the "official policy" for now.
2017-05-14 23:29:36 +02:00
ines
fe5a5086e1 Fix typo 2017-05-14 23:27:56 +02:00
ines
1ae07da18f Add API docs for spacy.displacy (see #1058) 2017-05-14 19:31:23 +02:00
ines
844d64298d Fix formatting 2017-05-14 01:31:16 +02:00
ines
b462076d80 Merge load_lang_class and get_lang_class 2017-05-14 01:31:10 +02:00
ines
1465c6c221 Add API docs for util functions 2017-05-13 21:23:12 +02:00
ines
144161c58c Update links to dev resources 2017-05-13 21:23:02 +02:00
ines
0095d5322b Update adding languages docs 2017-05-13 18:54:10 +02:00
ines
1d94c0e98a Update table of contents 2017-05-13 15:42:51 +02:00
ines
a48e21755e Add section on testing language tokenizers 2017-05-13 15:39:27 +02:00
ines
326e677882 Fix syntax highlighting colour of keyword 2017-05-13 15:37:43 +02:00
ines
9f004394aa Use thicker & round dotted lines in graphic 2017-05-13 15:37:28 +02:00
ines
2f54fefb5d Update adding languages docs 2017-05-13 14:54:58 +02:00
ines
3665acc0de Update adding languages docs 2017-05-13 12:39:36 +02:00
ines
2e4db1beb9 Fix formatting 2017-05-13 12:02:39 +02:00
ines
3454f2aca8 Update showcase 2017-05-13 03:32:03 +02:00
ines
67726d1837 Update data model docs 2017-05-13 03:10:56 +02:00
ines
915b50c736 Update adding languages docs 2017-05-13 03:10:50 +02:00
ines
7f331eafcd Add SVG object 2017-05-13 03:10:41 +02:00
ines
d5c83a5810 Fix image mixin to allow figure with no args 2017-05-13 03:10:35 +02:00
ines
a74376dca9 Add flow chart graphics 2017-05-13 03:10:21 +02:00
ines
19879cb693 Update alpha support docs 2017-05-12 15:57:49 +02:00
ines
1774cf5152 Fix light versions of colors 2017-05-12 15:57:42 +02:00
ines
63d79947c8 Update title in navigation 2017-05-12 15:40:43 +02:00
ines
531ee1373b Rename "Language models" to "Languages" in API 2017-05-12 15:38:56 +02:00
ines
c4d2c3cac7 Update adding languages docs 2017-05-12 15:38:17 +02:00
ines
fac3566aac Add descriptions to POS tagging scheme 2017-05-03 20:11:02 +02:00
ines
1570b83ee5 Add spacy.explain() note to NER annotation scheme 2017-05-03 20:11:02 +02:00
ines
219369bb7d Add detailed docs for dependency label annotations 2017-05-03 20:11:02 +02:00
ines
0de98472b3 Increment CSS version 2017-05-03 20:11:02 +02:00
ines
7631d08d67 Adjust saturation of light theme color 2017-05-03 20:11:02 +02:00
ines
06e414b3fc Don't wrap inline code 2017-05-03 20:11:02 +02:00
ines
41c6085a6c Add pos-row and dep-row mixins to global mixins 2017-05-03 20:11:02 +02:00
ines
f9384b0fbd Update alpha languages and add aside for tokenizer dependencies 2017-05-03 09:58:31 +02:00
Yasuaki Uechi
0e7a9b9fac Add Japanese to 'Alpha support’ section 2017-05-03 13:56:45 +09:00
Ines Montani
fb96f88b59 Update info on CoNLL format and include link 2017-04-27 14:36:08 +02:00
M. Z. Ferdous (Imran)
c9f9203d5f fix typo, CONLL format
tried to google about connlu format. Saw there is conll format, not connlu.
2017-04-27 16:48:54 +06:00
ines
5aa49971f9 Add French example to models docs 2017-04-27 12:08:47 +02:00
ines
034ec5710b Fix typo and add Norwegian to alpha languages 2017-04-27 11:24:21 +02:00
ines
100846bed3 Fix typo in model list 2017-04-26 21:40:17 +02:00
ines
375edf0bb5 Add list of models and include French 2017-04-26 20:50:27 +02:00
ines
4eacd72bc3 Move list of models to own file 2017-04-26 20:50:27 +02:00
ines
c2006166d3 Update list of available models and info 2017-04-26 16:03:41 +02:00
ines
5a470367df Add mixin for model row in model docs 2017-04-26 16:03:17 +02:00
ines
5d598b6747 Add star icon 2017-04-26 16:03:05 +02:00
ines
6c4f3c6fc2 Allow styles arguments on row mixin 2017-04-26 16:02:59 +02:00
ines
99558023fd Add divider table row style 2017-04-26 16:02:44 +02:00
ines
e6bdf5bc5c Update adding language / training docs (see #966)
Add data examples and more info on training and CLI commands
2017-04-26 14:01:19 +02:00
ines
ae2b77db1b Fix info on naming conventions 2017-04-26 14:01:19 +02:00
Julien Chaumond
f997bceb07 Make object of the deep learning tutorial clearer
This is a great tutorial, but I think it is weirdly explained in the current form. The largest part of the code is about implementing the actual sentiment analysis model, not about counting entities. (which is not even present in the `deep_learning_keras.py` script in `examples`)
2017-04-24 11:55:41 +02:00
ines
2bfec1a4f8 Add note on languages with non-latin characters (see #996) 2017-04-23 15:58:40 +02:00
ines
ddd5194088 Update Language docs and docstrings 2017-04-17 01:52:13 +02:00
ines
2ab394d655 Fix whitespace 2017-04-17 01:45:00 +02:00
ines
7f776258f0 Add link to API docs 2017-04-17 01:41:46 +02:00
ines
aad80a291f Add save_to_directory method to API docs 2017-04-17 01:40:34 +02:00
ines
c6c3162c50 Fix lightning tour example (closes #889) 2017-04-17 00:00:30 +02:00
ines
02e7512b91 Increment version 2017-04-16 22:39:58 +02:00
ines
de5062711b Update adding languages workflow to reflect changes in __init__.py 2017-04-16 22:26:46 +02:00
ines
e4dd645c37 Update link 2017-04-16 20:37:46 +02:00
ines
dea79224ed Remove saving & loading docs and link to new workflow 2017-04-16 20:37:45 +02:00
ines
c365795bf6 Update navigation 2017-04-16 20:37:45 +02:00
ines
5bbbb7674b Add training examples to tutorials 2017-04-16 20:37:45 +02:00
ines
17e9743388 Add saving & loading models docs 2017-04-16 20:37:45 +02:00
ines
b15bdb5279 Update training docs 2017-04-16 20:37:45 +02:00
ines
5cb17b9f33 Add NER training docs 2017-04-16 20:37:45 +02:00
ines
d29c825ca4 Update docs for package command 2017-04-16 13:37:24 +02:00
ines
cf558e37c3 Update adding languages docs with new commands 2017-04-13 13:52:11 +02:00
Sohil
328678c7e9 Extra brace ")" creating error
There is an extra closing brace `)` which is creating error while running example.
2017-04-13 17:12:28 +05:30
ines
ecfbc0b621 Update user survey badge 2017-04-10 17:49:51 +02:00
ines
7b3fe42be6 Add user survey badge to landing page 2017-04-08 10:09:43 +02:00
ines
aa680e0312 Add landing badge mixin 2017-04-08 10:09:43 +02:00
ines
4da11350f6 Add user survey badge SVG 2017-04-08 10:09:43 +02:00
ines
97d6f42136 Fix SVG formatting 2017-04-08 10:09:43 +02:00
ines
5c7206c7f7 Add user survey link to website 2017-04-07 18:34:35 +02:00
ines
1f501af602 Add file name shadowing module issue to troubleshooting guide (see #953) 2017-04-07 16:21:32 +02:00
ines
2f38c1d77f Add documentation for new convert and model commands 2017-04-07 13:27:55 +02:00
ines
dcf8ab0c47 Merge branch 'develop' 2017-04-07 12:00:09 +02:00
ines
f33c4cbae1 Add --no-cache-dir error to troubleshooting docs (see #958) 2017-04-07 10:22:18 +02:00
ines
d6bbc3ffcd Fix formatting 2017-04-07 10:22:18 +02:00
ines
2c36a61ec5 Add spacyr to libraries 2017-04-03 18:12:38 +02:00
ines
4759fd437d Merge branch 'master' into develop 2017-03-29 10:37:13 +02:00
ines
e210496f78 Update Windows compiler docs 2017-03-29 10:35:20 +02:00
ines
e2ed2f0258 Bump version 2017-03-26 20:51:21 +02:00
ines
13df2d6a60 Add documentation for spaCy's JSON format 2017-03-26 15:56:15 +02:00
ines
5901c8f7f0 Update spacy train CLI documentation 2017-03-26 15:33:48 +02:00
ines
afd839f64b Add pip and conda badges to installation docs 2017-03-26 14:11:31 +02:00
ines
45d03ea05b Add BADGES settings and mixin for pip and conda badges 2017-03-26 14:11:22 +02:00
ines
9a481c9f42 Add "Troubleshooting" section 2017-03-26 13:42:36 +02:00
ines
d4a86b6394 Update formatting 2017-03-26 13:42:19 +02:00
ines
1dae97b2f6 Fix typos 2017-03-26 11:14:44 +02:00
ines
a9368b591a Update inline code style 2017-03-26 11:14:36 +02:00
ines
88160f5daa Update border style on infoboxes 2017-03-26 11:14:30 +02:00
ines
09d7f26bed Remove text shadow on selected text 2017-03-26 11:14:16 +02:00
ines
8389e05496 Hide home link and subsection title on small screens 2017-03-26 11:14:08 +02:00
ines
323b418cf1 Split docs menu item into Usage and API 2017-03-26 11:13:52 +02:00
ines
a5fc5fb0db Add Hebrew to list of alpha languages 2017-03-25 10:22:46 +01:00
ines
9600cd1b9e Fix download commands 2017-03-25 10:22:05 +01:00
ines
4a9a1126a4 Update syntax highlighting color scheme 2017-03-22 10:02:59 +01:00
ines
fa6e3cefbb Simplify package command docs 2017-03-21 11:35:29 +01:00
ines
49bbfdaac1 Add info on CLI to docs on own models 2017-03-21 11:25:01 +01:00
ines
09b24bc5a9 Add docs for package command 2017-03-21 11:19:21 +01:00
ines
81b28ca606 Update models docs with info on retraining own models 2017-03-20 18:01:55 +01:00
ines
ef5e261387 Add spacy_api project by @kootenpv to showcase 2017-03-19 12:49:40 +01:00
ines
fa1f2040a5 Use correct code block language 2017-03-18 18:19:50 +01:00
ines
ff277140f9 Add CLI docs 2017-03-18 15:24:50 +01:00
ines
e635e1f6f4 Update docs to reflect new commands 2017-03-18 15:24:42 +01:00
ines
e9d8d756fc Fix typo in pytest flags 2017-03-18 15:24:20 +01:00
ines
d44f85a85e Adjust background color of inline code in asides 2017-03-18 15:24:07 +01:00
ines
3926ffdb70 Update models docs 2017-03-17 19:26:37 +01:00
ines
76c0ea6cc6 Update models docs 2017-03-17 17:01:16 +01:00
ines
b322f31521 Update models docs 2017-03-17 16:09:56 +01:00
ines
7f25f64acc Update lightning tour 2017-03-17 13:11:00 +01:00
ines
881db170f7 Update latest news 2017-03-17 13:10:47 +01:00
ines
441383d380 Bump version 2017-03-17 12:52:55 +01:00
ines
e461fafd14 Update example 2017-03-16 23:23:35 +01:00
ines
f4df9463f2 Fix wording 2017-03-16 22:21:46 +01:00
ines
08b0fb62cc Update models docs 2017-03-16 22:09:43 +01:00
ines
4d9161476a Remove redundant MODELS_URL 2017-03-16 22:09:36 +01:00
ines
0b5c664b04 Update resources 2017-03-16 21:59:26 +01:00
ines
01288807ba Update spaCy version 2017-03-16 21:53:59 +01:00
ines
0f4a876a83 Add MODELS_URL variable 2017-03-16 21:53:54 +01:00
ines
807139ae61 Update installation docs and add models quickstart aside 2017-03-16 21:53:44 +01:00
ines
ec75c781b9 Add docs page for models 2017-03-16 21:53:31 +01:00
ines
3e3f20e68b Add infobox mixin 2017-03-16 21:53:15 +01:00
ines
3ad2179360 Add lighter version of theme colour 2017-03-16 21:53:00 +01:00
ines
b7722b1bff Add box component 2017-03-16 21:52:52 +01:00
ines
d406f1e9e6 Fix formatting and add link styling to all asides 2017-03-16 21:52:45 +01:00
ines
dd302e0faa Add bottom margin hack to prevent overlaps 2017-03-16 21:52:19 +01:00
ines
4c53eed35a Remove sputnik from dependencies and docs 2017-03-15 17:39:25 +01:00
ines
758335452d Update installation instructions and fix formatting 2017-03-08 11:36:00 +01:00
ines
004c4c9566 Update installation docs
Include conda and virtualenv info for pip, add instructions for
downloading models manually and add details and fab commands to
"Compile from source" section.
2017-03-07 18:52:22 +01:00
yalei
27c0e6226b Edit example code
The original code forget to import the `random` module and the `EntityRecognizer` module.
2017-03-07 18:07:40 +08:00
ines
d25f17f139 Add Bengali to list of languages (see #865) 2017-03-01 15:59:21 +01:00
ines
2b07ab7db4 Add feature scheme to API docs (see #857, #739) 2017-02-24 18:26:32 +01:00
ines
8ddad178f6 Add book and tutorial 2017-02-24 18:26:32 +01:00
ines
00728a23f0 Fix path in gitignore 2017-02-24 18:26:32 +01:00
Ines Montani
49a102aff3 Merge pull request #841 from jondoughty/patch-1
Updated Token class documentation
2017-02-16 23:47:51 +01:00
Jon Doughty
12a8757343 Update token.jade 2017-02-16 10:55:33 -08:00
nycmonkey
8946a2a496 Fix typo in IOB integer to letter map
ent_iob value for an ent.iob_ value of 'B' should be 3, not B
2017-02-16 13:49:57 -05:00
ines
f6b69babcc Fix years in footer 2017-02-16 12:14:35 +01:00
John Gamboa
e31894b800 Fixes example 3 of entity recognition (see issue #832) 2017-02-16 11:19:53 +01:00
Stefan Bunk
2bf19d4735 Fix error in pipeline loading documentation
The cell for the `vocab` parameter is not displayed, making it seem as if the explanation belongs to the previous param.
2017-02-10 12:06:55 +01:00
ines
1b8719bf9a Adjust formatting and increment version 2017-02-08 21:33:22 +01:00
Sébastien Lerique
e1f87858ad Make the website nav header's hysteresis a bit more robust
In particular, this prevents the nav header from reappearing all the
time while scrolling down on Firefox.
2017-02-08 15:08:33 +01:00
Stefan Bunk
e972b2fa87 Fix error in matching documentation
LOWER and IS_PUNCT are members of `spacy` and not of the `Matcher` class.
2017-02-07 16:52:01 +01:00
Matthew Honnibal
9aaa2c5633 Fix entity recognition example (closes #803) 2017-02-05 11:23:12 +01:00
ines
a44da8fb34 Update language models and alpha support overview 2017-02-04 13:49:05 +01:00
Ines Montani
651bf411e0 Add tutorial 2017-01-26 13:48:38 +01:00
Ines Montani
da3aca4020 Fix formatting 2017-01-26 13:48:29 +01:00
Ines Montani
baa6be8180 Update latest news to last blog post 2017-01-26 13:47:45 +01:00
Ines Montani
bdafb514c5 Update version 2017-01-26 13:47:32 +01:00
Hidekazu Oiwa
7806ebafd2 Fix the span doc typo
Fix the typo in the span API doc.
It explains the `end` of the span as the `start_char` description.
2017-01-17 20:37:14 -08:00
Kevin Gao
7ec710af0e Fix Custom Tokenizer docs
- Fix mismatched quotations
- Make it more clear where ORTH, LEMMA, and POS symbols come from
- Make strings consistent
- Fix lemma_ assertion s/-PRON-/me/
2017-01-17 10:38:14 -08:00
Ines Montani
dbe8dafb52 Fix logo width and height to avoid link overlap in Safari (resolves #748) 2017-01-17 17:56:34 +01:00
Ines Montani
ee45619307 Fix formatting 2017-01-17 17:55:59 +01:00
Jason Kessler
9fa6f9fb40 Origin of spacy.matcher attributes
Make it clear that Matcher attributes live in spacy.matcher.attrs.
2017-01-16 13:31:35 -06:00
jktong
df0aeff379 Correct typo "chldren" in doc.jade 2017-01-16 09:34:59 -05:00
Ines Montani
57919566b8 Add Jupyter notebooks repo to resources list 2017-01-05 20:50:08 +01:00
Ines Montani
d677db6277 Change "Multi-language support" to amber for spaCy 2017-01-03 21:24:35 +01:00
Ines Montani
6f51609b5e Use yellow color for neutral pro/con icon 2017-01-03 21:24:14 +01:00
Ines Montani
1b82756cc7 Tidy up and fix formatting and consistency 2017-01-02 00:29:24 +01:00
Ines Montani
614f95f3bf Remove help cursor from API links 2017-01-02 00:29:08 +01:00
Ines Montani
87c7496065 Use better chat window icons with more compact markup 2017-01-01 13:25:28 +01:00
Ines Montani
a1a4b253a1 Add Gitter chat widget component to docs 2017-01-01 12:46:01 +01:00
Ines Montani
78e54b375f Move scripts to own file 2017-01-01 12:45:37 +01:00
Ines Montani
134e115d9c Bump version 2017-01-01 12:45:17 +01:00
Ines Montani
4acd026cb6 Add missing documentation to mixins 2017-01-01 12:43:43 +01:00
Ines Montani
e3d84572f2 Fix ents input format example 2017-01-01 12:28:37 +01:00
Ines Montani
a9a7cddf5b Update icons and remove unused SVG meta 2017-01-01 03:18:51 +01:00
Ines Montani
cd0da315d5 Bump version 2017-01-01 03:18:36 +01:00
Ines Montani
3ca8de4666 Use rem value for top/bottom card padding
Fix rendering / interpretation error in Firefox
2017-01-01 03:18:08 +01:00
Ines Montani
2afbf6b6c0 Add missing closing tag for symbol 2017-01-01 03:17:43 +01:00
Ines Montani
d845ab3d20 Add Gitter room to social meta 2017-01-01 03:17:29 +01:00
Guy Rosin
acdd2fc9a6 Tiny code typo 2016-12-31 14:53:05 +02:00
Ines Montani
d1585959d9 Add Hungarian to alpha support overview 2016-12-27 22:31:41 +01:00
Ines Montani
e80dad8616 Update version 2016-12-27 22:18:48 +01:00
Ines Montani
b7becaec85 Fix typo 2016-12-25 15:23:32 +01:00
Ines Montani
6dd8ae1b0d Update README.md 2016-12-25 14:43:40 +01:00
Ines Montani
f6f6e028ea Make links detect target automatically and replace false with null for no attribute
New version of Harp would render attribute=false as attribute="false",
while attribute=null renders element without attribute.
2016-12-24 12:24:04 +01:00
Ines Montani
b893126c12 Use link mixin instead of plain link markup 2016-12-24 12:22:52 +01:00
Ines Montani
207555fae7 Fix spelling 2016-12-23 21:36:01 +01:00
Ines Montani
48b03b4001 Fix formatting and wording 2016-12-23 14:36:03 +01:00
Ines Montani
cc051ddc15 Add resources page to usage docs 2016-12-23 14:36:03 +01:00
Ines Montani
11ec02d5e3 Separate inline icon and help cursor classes 2016-12-23 14:36:03 +01:00
Ines Montani
d1a2846750 Document DET_LEMMA 2016-12-21 18:18:35 +01:00
Ines Montani
71c00db8a5 Update language models page 2016-12-21 00:54:54 +01:00
aikramer2
349143faa2 update to training doc 2016-12-20 12:01:16 -08:00
Ines Montani
a2525c76ee Reformat word frequencies section in "adding languages" workflow 2016-12-19 17:18:38 +01:00
Ines Montani
ddf5c5bb61 Generalise dependency parsing annotation specs beyond English (closes #657) 2016-12-19 13:42:44 +01:00
Ines Montani
6a793251c8 Add aside on spaCy's custom pronoun lemma 2016-12-19 13:41:47 +01:00
Ines Montani
d0c15730c4 Fix link 2016-12-19 13:09:45 +01:00
Ines Montani
a9c0e77b80 Fix typo 2016-12-19 13:09:45 +01:00
Ines Montani
fa65c6b54c Add "Adding languages" workflow (closes #562) 2016-12-18 23:54:19 +01:00
Ines Montani
1cddb7da36 Add "Part-of-speech tagging" workflow (closes #581) 2016-12-18 23:54:19 +01:00
Ines Montani
89398ca57b Bump version 2016-12-18 23:54:19 +01:00
Ines Montani
ac597b58f6 Update showcase 2016-12-18 23:54:18 +01:00
Ines Montani
614ca6fb41 Split annotation specs into files to they can be included in different places 2016-12-18 17:42:10 +01:00
Ines Montani
ac95779a75 Wrap src mixin in nowrap to prevent line break between text and icon 2016-12-18 17:41:03 +01:00
Ines Montani
6f8b555ab0 Add nowrap utility class 2016-12-18 17:40:30 +01:00
Ines Montani
ce8bf08223 Fix formatting 2016-12-18 17:40:20 +01:00
David Edwards
278199dd2c Update index.jade 2016-12-15 13:40:53 -08:00
jaspb
3d7f81ddf5 added 'en' to spacy.load(..) 2016-12-10 19:18:13 +00:00
Tobias Macey
1d768d6510 Fixed minor typo
The word `motto` was missing the second `t`.
2016-12-01 06:08:33 -05:00
Jimi Smoot
8373115cbd Minor typos 2016-11-25 18:22:52 -08:00
Ines Montani
ada007cb73 Fix formatting for consistency 2016-11-25 15:53:40 +01:00
Ines Montani
19f27cc6ef Use consistent entity tables across docs 2016-11-25 15:48:50 +01:00
Ines Montani
e0c7a22f09 Add usage workflow for entity recognizer 2016-11-25 02:30:31 +01:00
Ines Montani
c8e69b98cc Update tutorial tags 2016-11-25 02:30:31 +01:00
Ines Montani
bf65d070ef Add CodePen embed mixin 2016-11-25 02:30:31 +01:00
Ines Montani
6f7835bb70 Add tutorial 2016-11-24 19:25:21 +01:00
Ines Montani
a7b5fba132 Merge pull request #642 from ExplodingCabbage/specify-data-path
Let --data-path be specified when running download.py scripts
2016-11-23 13:05:03 +01:00
Will Thompson
e896466dcf
docs: processing-text: fix missing line wrap 2016-11-21 10:43:16 +00:00
Will Thompson
1adc96f0a6 docs: fix "installaton" typo 2016-11-21 10:37:57 +00:00
Mark Amery
2dc305f46b Merge remote-tracking branch 'origin/master' into specify-data-path 2016-11-20 18:29:06 +00:00
Ines Montani
20c8fc5255 Merge pull request #645 from ExplodingCabbage/formatting-mistake
Fix another typo on the website
2016-11-20 19:13:53 +01:00
Mark Amery
270d42e73a Fix another typo on the website 2016-11-20 17:08:04 +00:00
Mark Amery
b4e1dc0e3f Fix a bunch of missing spaces of the website 2016-11-20 17:02:45 +00:00
Mark Amery
a0c4b29dcb Document new --data-path argument 2016-11-20 16:52:56 +00:00
ExplodingCabbage
b6e507e026 Fix spelling error on website front page 2016-11-20 16:02:54 +00:00
Paul Dechov
537f9eaaf8 [DOCS] Typo 2016-11-17 16:29:39 -05:00
tjrileywisc
464a4f3f6f Fixed a minor typo in deep learning tutorial docs. 2016-11-17 13:38:10 -05:00
Matthew Honnibal
af953cf2e6 Merge pull request #620 from savkov/patch-1
Missing import statement for spacy.matcher.Matcher
2016-11-16 06:08:44 +11:00
Sasho Savkov
a8831a85e4 Added missing brackets & suggested import statmnt
There are two missing brackets on the `add_pattern` lines. I also suggest you include the `from spacy.tokens.doc import Doc` statement to make it easy for people to copy paste a working example.
2016-11-11 17:12:56 +00:00
Sasho Savkov
250879bb96 Missing import statement
It is useful to know where the Matcher class is if you haven't used it before. Or you are simply too lazy to remember, like me :)

FYI: some packages don't appear in the PyCharm autocompletion lists. `spacy.matcher` is one of them.
2016-11-11 12:04:08 +00:00
Ines Montani
8abc2084ff Add user survey results as latest news 2016-11-07 22:48:35 +01:00
Ines Montani
0a90b141f4 Trust link 2016-11-07 21:50:40 +01:00
Ines Montani
bf3c1c7a48 Add link to dependency parse workflow 2016-11-07 21:32:03 +01:00
Ines Montani
418c084f12 Replace "" with false to prevent rending of empty attributes 2016-11-07 02:18:36 +01:00
Ines Montani
da52bcf080 Patch version 2016-11-07 02:17:48 +01:00
Ines Montani
3502654551 Add option for "latest news" on landing page 2016-11-07 02:14:43 +01:00
Ines Montani
f91bf4d59c Add right margin to tags to make them usable inline 2016-11-07 02:14:26 +01:00
Ines Montani
4352371b36 🔴 Fix bug that would prevent rendering of robots.txt 2016-11-07 02:13:45 +01:00
Ines Montani
d5668cf0d2 Add spacy-api-docker to showcase 2016-11-06 13:46:20 +01:00
Ines Montani
98c8e70dc2 Update installation docs 2016-11-06 13:46:11 +01:00
Ines Montani
c20abc8a6d Add customizing tokenizer and training workflow 2016-11-05 20:40:11 +01:00
Ines Montani
5e4e5b600f Update language models docs 2016-11-05 02:50:55 +01:00
Ines Montani
9251c8991a Update version 2016-11-05 02:50:55 +01:00
SultanMirza
daedf2c153 Fixing typos and errors!!
Fixed some typos and errors on the page.
2016-11-04 20:54:28 +05:30
Ines Montani
8f584ba468 Update logos 2016-11-03 22:09:59 +01:00
Ines Montani
f343295beb Update logo order and add sizes 2016-11-03 22:09:59 +01:00
Ines Montani
c87b709f23 Use grid and no grayscale for logo wall rendering 2016-11-03 22:09:59 +01:00
Ines Montani
aad0495ff6 Add grid style to center content horizontally and vertically 2016-11-03 22:09:59 +01:00
Ines Montani
7d0a1d9c67 Add option for grid modifier classes 2016-11-03 22:09:59 +01:00
SultanMirza
d824f8c322 removed typo 2016-11-03 21:53:58 +05:30
Ines Montani
c4a8ad356e Update website README.md 2016-11-03 13:06:29 +01:00
Ines Montani
b5abdcb390 Fix formatting 2016-11-03 13:06:05 +01:00
Ines Montani
42bf4ff9fe Add TruthBot to showcase 2016-11-03 12:43:55 +01:00
Ines Montani
c748474a9e Fix formatting 2016-11-03 01:52:31 +01:00
Ines Montani
a1cee7ade5 Add subtle background to active items 2016-11-03 01:52:25 +01:00
Ines Montani
e5df86419e Move sidebar padding to individual items 2016-11-03 01:52:15 +01:00
Ines Montani
b22cb8ae91 Patch version 2016-11-03 00:39:19 +01:00
Ines Montani
2515b32a74 Add documentation for Tokenizer API (see #600) 2016-11-02 23:18:02 +01:00
Ines Montani
309a8c2c0f Use more distinct color for table footers 2016-11-02 23:18:01 +01:00
Ines Montani
adf04a6ad3 Adjust tutorial category name 2016-11-02 12:11:17 +01:00
Ines Montani
2c65c15d7a Fix typo 2016-11-02 11:25:09 +01:00
Ines Montani
823e47d946 Add language models to API docs (fixes #598) 2016-11-02 11:24:13 +01:00
Ines Montani
35ad353dc2 Fix odd row color to show scroll shadow 2016-11-02 11:21:17 +01:00
Ines Montani
0438137f2f Add language models to features (see #598) 2016-11-02 10:47:02 +01:00
Ines Montani
85b0dd9ad6 Change wording 2016-11-02 10:47:02 +01:00
Ines Montani
37a3772fff Fix typo in website README.md 2016-11-01 22:41:35 +01:00
Ines Montani
d3b6a594f8 Add Natural Language Inference tutorial 2016-11-01 03:27:20 +01:00
Ines Montani
4b84b4522b Update link in examples 2016-11-01 03:06:33 +01:00
Ines Montani
838284c254 Fix relative links in website README.md 2016-11-01 02:17:15 +01:00
Ines Montani
827d8f5c7c Adjust card padding 2016-11-01 02:13:06 +01:00
Ines Montani
ce3e72bee9 Simplify scripts and add versioning variable to JS and CSS 2016-11-01 02:13:06 +01:00
Ines Montani
2a3cc16b25 Reorder and update global meta 2016-11-01 02:13:06 +01:00
Ines Montani
e0732435fa Use viewport-relative units for aside width 2016-11-01 00:16:42 +01:00
Ines Montani
52a684a924 Make grid columns break earlier 2016-11-01 00:16:42 +01:00
Ines Montani
8ab2537661 Combine card styles into object with relative padding 2016-11-01 00:16:42 +01:00
Ines Montani
201445b3b8 Fix benchmarks intro 2016-10-31 20:55:59 +01:00
Ines Montani
3ab0fdf064 Update social title 2016-10-31 19:18:32 +01:00
Ines Montani
06f2374f98 Remove old files 2016-10-31 19:18:12 +01:00
Ines Montani
ed4d231bb7 Delete .gitignore 2016-10-31 19:05:07 +01:00
Ines Montani
7615b41bff Update to new website 2016-10-31 19:04:15 +01:00
Pokey Rule
603a3f40c5 Fix small bug in code of mark-adverbs tutorial 2016-10-26 15:23:36 +01:00
Mahmoud Lababidi
f8ce28058c fix typo in url 2016-10-24 15:21:18 -04:00
Ines Montani
efaa8eaf1f Add matcher to navigation 2016-10-24 00:52:17 +02:00
Ines Montani
26dc3f3ebf Fix indentation errors 2016-10-24 00:52:17 +02:00