Commit Graph

1059 Commits

Author SHA1 Message Date
ines
14f97cfd20 Add note on stream processing to migration guide (see #1508) 2017-11-08 01:53:36 +01:00
ines
5d1162cf21 Improve nlp.update / training loop overview (see #1507) 2017-11-08 01:17:42 +01:00
ines
2229aba71c Update website 2017-11-08 01:06:30 +01:00
ines
1768703e1c Update website for v2.0 2017-11-07 14:48:17 +01:00
ines
e4a05385d6 Update docs 2017-11-07 12:33:43 +01:00
ines
a4662a31a9 Move model package templates to cli.package and update docs 2017-11-07 12:15:35 +01:00
ines
a09c096d3c Get docs ready for v2.0.0 2017-11-07 12:00:43 +01:00
ines
173b1551af Update examples 2017-11-07 01:22:30 +01:00
ines
c37837cad1 Update training docs 2017-11-07 01:06:31 +01:00
ines
c7bda87b17 Update model docs and add tips section 2017-11-07 01:05:37 +01:00
ines
a1261e8632 Fix formatting 2017-11-07 01:05:30 +01:00
ines
912c1b1821 Document "simple training style" 2017-11-07 00:23:19 +01:00
ines
ad6438ccdf Update aside labels and under construction mixin 2017-11-07 00:23:00 +01:00
ines
8fb48b9b91 Update and document new util functions 2017-11-07 00:22:43 +01:00
ines
6447b8e396 Update v2 details 2017-11-06 21:15:36 +01:00
ines
008d7408cf Make vectors vs. tensors more explicit in 101 (see #1498) 2017-11-06 20:16:38 +01:00
ines
71852d3f25 Fix code mixins 2017-11-06 20:16:19 +01:00
ines
3b0699c9fe Update benchmarks and data table style 2017-11-06 19:36:02 +01:00
ines
ddff7dc474 Update GPU install docs 2017-11-06 19:35:36 +01:00
ines
64d0f97c67 Update benchmarks and models 2017-11-06 18:19:00 +01:00
Matthew Honnibal
6fdffd7246
Merge pull request #1497 from explosion/feature/improve-optimizer-handling
💫 Improve optimizer handling
2017-11-06 16:41:15 +01:00
ines
972298e0c9 Update Pipe component docs and training API 2017-11-06 14:42:24 +01:00
ines
f48e1973ed Fix accuracy table descriptions 2017-11-06 14:12:11 +01:00
ines
2d85ee6b5d Fix broken link 2017-11-06 13:27:30 +01:00
ines
efb0a7e934 Fix broken links 2017-11-06 13:20:36 +01:00
ines
42a99eae02 Update troubleshooting guide 2017-11-06 13:17:09 +01:00
ines
2dca9e71a1 Add notes on catastrophic forgetting (see #1496) 2017-11-06 13:17:02 +01:00
ines
e68d31bffa Update models quickstart usage example 2017-11-06 13:06:26 +01:00
ines
2fe2c4942f Update models directory and listing 2017-11-06 13:04:29 +01:00
ines
df1bdc7173 Add Dutch model 2017-11-06 02:44:59 +01:00
ines
333bef482f Update pattern for Prism.js Python 2017-11-06 02:44:24 +01:00
ines
6b08aefd0c Update formatting and styleguide 2017-11-05 23:31:31 +01:00
ines
e61a067c4b Update v2 docs 2017-11-05 21:41:56 +01:00
ines
86d6bd7503 Fix wording 2017-11-05 19:23:50 +01:00
ines
6742657c4d Fix website asset versioning 2017-11-05 19:23:45 +01:00
ines
2ca82d1f6e Take out pt_core_news_sm for now 2017-11-05 18:57:04 +01:00
ines
a6ffa942bb Update UD schemes 2017-11-05 18:46:24 +01:00
ines
3fa8900a6b Don't include tag and label schemes in usage guide 2017-11-05 18:21:49 +01:00
ines
4810be4b44 Update POS scheme docs and add links for other schemes 2017-11-05 18:16:34 +01:00
ines
e7d0641125 Update POS row mixins 2017-11-05 18:16:16 +01:00
ines
15de2bb01d Update and simplify other annotation scheme data 2017-11-05 16:09:48 +01:00
ines
2d59dd374b Use collapsible sections for pos/dep scheme and update
Will ensure better overview as we add more schemes for more languages
2017-11-05 16:09:30 +01:00
ines
a9c77e01b4 Add accordion component (collapsible section) 2017-11-05 16:08:13 +01:00
ines
3d4dff1845 Remove comment 2017-11-05 16:07:14 +01:00
ines
b53c2010db Add global focus style for links 2017-11-05 16:07:00 +01:00
ines
f092506578 Use hidden attribute instead of style.display 2017-11-05 16:06:50 +01:00
ines
0e8157674a Add Portuguese and French 2017-11-04 23:07:21 +01:00
ines
d9fa3c6054 Update adding languages example 2017-11-04 15:12:39 +01:00
ines
c83fe54f0c Update venv docs in installation instructions 2017-11-04 14:27:55 +01:00
ines
2940938bd8 Use more distinct style for checkboxes in quickstart 2017-11-04 14:24:30 +01:00
ines
4793d56a3e Update commands for building from source 2017-11-04 14:24:14 +01:00
ines
177bf4ee39 Update GitHub topic links 2017-11-04 14:02:28 +01:00
ines
2639ecd5f8 Add docs note on custom tokenizer rules (see #1491) 2017-11-03 23:33:18 +01:00
ines
380f2441b4 Fix script includes 2017-11-03 18:51:03 +01:00
Abhinav Sharma
c740277f9f
Minor typo [ nad => and ] 2017-11-03 16:30:44 +05:30
ines
1e16374687 Update models list to reflect spaCy v2.0.0a18 2017-11-03 11:29:34 +01:00
ines
a62b0727d8 Tidy up and always use bundle in built site for now
Just to be safe
2017-11-03 11:29:21 +01:00
ines
d0f88af5b6 Hide error earlier 2017-11-03 11:29:04 +01:00
ines
43512c68b2 Fix vector details in model overview 2017-11-02 20:04:13 +01:00
ines
9baab241b4 Add skeleton language data for Turkish 2017-11-02 16:32:24 +01:00
ines
31e349a62c Update model families 2017-11-02 16:13:38 +01:00
ines
15cbc61a6e Adjust rendering of large numbers
1234 -> 1.2k
12345 -> 12.3k
123456 -> 123k
1234567 -> 1.2m
2017-11-02 16:13:18 +01:00
ines
391fce09d9 Update licenses 2017-11-01 23:04:40 +01:00
ines
c6fea3e5f6 Add Romanian and Croatian skeletons (experimental)
Add language data templates to make it easier for others to contribute to the language support
2017-11-01 23:04:28 +01:00
ines
408f450ce0 Tidy up 2017-11-01 23:01:12 +01:00
ines
2fa53b39d5 Add dev dependency 2017-11-01 23:01:06 +01:00
ines
1976fb157f Update licenses 2017-11-01 21:49:57 +01:00
ines
2ba4e4fc88 Fix broken links and add check_links shortcut script 2017-11-01 21:11:10 +01:00
ines
e5a4c31bb4 Adjust code line height 2017-11-01 19:49:42 +01:00
ines
5dd0d6a383 Update lightning tour 2017-11-01 19:49:36 +01:00
ines
9b4c38fe9f Add button option to terminal component 2017-11-01 19:49:27 +01:00
ines
12954ab218 Don't document the tensorizer for now 2017-11-01 19:49:04 +01:00
ines
a7a76ea8c5 Update backwards incompatibilities
Also add separate section for deprecated
2017-11-01 16:31:57 +01:00
ines
4f77bb8476 Fix error handling 2017-11-01 16:29:55 +01:00
ines
5ab4e96144 Update v2 guide and split into partials 2017-11-01 14:13:36 +01:00
ines
1c7313051f Document Token.is_sent_start 2017-11-01 14:13:22 +01:00
ines
9e429b5a8a Update formatting of deprecation note 2017-11-01 14:13:08 +01:00
ines
0fbab8160d Update GloVe vectors example 2017-11-01 13:14:43 +01:00
ines
a6f6bd6c98 Adjust tag spacing 2017-11-01 02:04:00 +01:00
ines
f84660986a Update example sentences for models quickstart 2017-11-01 01:57:33 +01:00
ines
3b7ec64caa Add PYTHONPATH to build from source quickstart 2017-11-01 01:52:45 +01:00
ines
092333afd4 Update vector details and number conversion 2017-11-01 01:47:31 +01:00
ines
5fd851a80b Log errors 2017-11-01 01:46:50 +01:00
ines
07d02c3304 Update vectors and similarity usage guide 2017-11-01 01:25:17 +01:00
ines
0d8f4a534b Update Vectors API docs 2017-11-01 00:56:54 +01:00
ines
9eb998443f Update language tokenizer dependencies 2017-11-01 00:56:35 +01:00
ines
0cde065ed9 Add Irish to list of languages (see #1152) 2017-11-01 00:56:21 +01:00
Ines Montani
3c8db3e4da
Merge pull request #1473 from explosion/refactor-javascript
Refactor website JS and add model comparison tool
2017-10-31 14:02:05 +01:00
ines
be5b635388 Remove "needs model" and add info about models (see #1471) 2017-10-31 13:37:55 +01:00
ines
5af6c8b746 Update training docs 2017-10-30 20:28:00 +01:00
ines
8ad4f3f6e5 Take out JSON format include in tagger/parser 2017-10-30 19:48:35 +01:00
ines
33af6ac69a Use even smaller examle size
100 was still too much, so try 20 instead
2017-10-30 19:46:45 +01:00
ines
f02b0af821 Fix path and use smaller example size
500 was too larger and caused laggy rendering
2017-10-30 19:44:35 +01:00
ines
18dde7869a Update training data docs and add vocab JSONL 2017-10-30 19:40:05 +01:00
ines
57534253e6 Move CLI docs to own page 2017-10-30 19:39:26 +01:00
ines
ec657c1ddc Update vocab docs and document Vocab.prune_vectors 2017-10-30 19:35:41 +01:00
ines
12343e23fd Update CLI docs and document vocab command 2017-10-30 18:59:08 +01:00
ines
5598542055 Add link 2017-10-30 18:58:55 +01:00
ines
abf8aa05d3 Populate --create-meta defaults from file if available
If meta.json is found in directory and user chooses to overwrite it, show existing data as defaults.
2017-10-30 18:39:38 +01:00
ines
3ffbb64ab6 Unify chart options and update styleguide 2017-10-30 17:25:49 +01:00
ines
14ad92d337 Ensure fallbacks / progressive enhancement if JS disabled 2017-10-30 16:16:19 +01:00
ines
1eb1ed0c7c Add tool for model comparison (experimental)
User can select two model and their meta is fetched from GitHub. Features, accuracy figures and speed benchmarks are displayed in a table, with an additional chart comparing the accuracy scores if available. Main use case: demonstrating and visualising trade-offs between larger and smaller models of the same type.
2017-10-30 14:09:43 +01:00
ines
fb2710211b Integrate rollup into website build process 2017-10-30 14:08:26 +01:00
ines
38ef4274b6 Remove confusing icon for non-compatible models
ModelLoader will now output "not compatible" if no compatible version of model is found for a spaCy version
2017-10-30 14:07:42 +01:00
ines
8db3da3c3d Refactor JS, split into modules and add nomodule option
rollup.js will be compiled by the rollup package and Babel on build, and will be loaded if a browser doesn't yet support JS modules
2017-10-30 14:06:25 +01:00
ines
5453821a9f Update NER annotation scheme
Add note on training data sources and include coarse-grained Wikipedia scheme
2017-10-30 13:53:49 +01:00
ines
df149455f9 Don't ever wrap navigation bar contents 2017-10-30 13:16:20 +01:00
ines
74dd0ee2c2 Prevent responsive tables form scrolling vertically 2017-10-30 13:16:06 +01:00
ines
ae45446978 Remove comment 2017-10-30 13:15:46 +01:00
ines
25f6331550 Allow other style arguments on +grid-col 2017-10-30 13:15:30 +01:00
ines
08869c19fd Merge mixins and mixins-base
The distinction was never clear anyways and it was progressively getting messier. So all mixins live in one file now.
2017-10-30 13:15:13 +01:00
ines
ae2ad5becc Remove charts from model direcory and add speed benchmarks
With speed benchmarks, charts ended up taking up too much space – and they were mostly data porn and not particularly useful anyways. Instead, we might add a "Compare" page that fetches all models and lets the user compare two or more models in terms of accuracy, speed etc.
2017-10-29 03:58:19 +01:00
ines
47fd254ba7 Combine table scroll shadows if row has only one cell 2017-10-29 03:56:37 +01:00
ines
b11928abc2 Adjust labels, spacing and hack specificity 2017-10-29 03:56:09 +01:00
ines
af0ba014d2 Document +code-new and +code-old 2017-10-29 03:54:13 +01:00
ines
9b6828bd83 Add height option to +chart and document 2017-10-29 03:53:59 +01:00
ines
e18744823b Add placeholders for Italian and Portuguese models 2017-10-29 01:29:39 +02:00
ines
3b1cfa3455 Add GPL license link 2017-10-29 01:18:32 +02:00
ines
5147cdc468 Fix formatting and add missing v2 label 2017-10-29 01:18:09 +02:00
ines
53bfcdba31 Make tooltips/tags and old/new code blocks more accessible (see #(see #1471))
Always add tooltip text as hidden label. Use different tooltip icons for tags and inline help icons. Add labels to old/new code blocks and add option to customise label text.
2017-10-29 01:17:49 +02:00
ines
4a4f9666b2 Improve style/accessibility of yes/no/neutral icons (see #1471)
Use distinctive icons instead of only colour, add proper handling of labels (hidden or visible, but always present) with optional custom text.
2017-10-29 01:14:30 +02:00
ines
a8e10f94e4 Tidy up Lexeme and update docs 2017-10-27 21:07:50 +02:00
ines
5167a0cce2 Tidy up Vectors and docs 2017-10-27 19:45:19 +02:00
ines
544a407b93 Tidy up Doc, Token and Span and add missing docs 2017-10-27 17:07:26 +02:00
ines
6a0483b7aa Tidy up and document Doc, Token and Span 2017-10-27 15:41:45 +02:00
ines
298c3d973c Document Doc.get_lca_matrix 2017-10-27 14:37:53 +02:00
ines
9ff9afe889 Update spacy convert CLI docs 2017-10-27 14:37:42 +02:00
ines
52f1bf2729 Adjust GitHub embeds 2017-10-27 12:30:59 +02:00
Ines Montani
4033e70c71 Merge pull request #1461 from explosion/feature/disable-pipes
💫 Add Language.disable_pipes(), to temporarily edit pipeline and update code examples
2017-10-27 12:21:40 +02:00
ines
b5643d8575 Update intent parser docs and add to usage docs 2017-10-27 04:49:05 +02:00
ines
954c88f4d8 Fix formatting 2017-10-27 04:48:41 +02:00
ines
af28ca1ba0 Move example to pipeline directory 2017-10-27 02:00:01 +02:00
ines
1d69a46cd4 Update multi-processing example and add to docs 2017-10-27 01:58:55 +02:00
ines
647ef64f86 Update textcat docs 2017-10-27 00:51:29 +02:00
ines
a7b9074b4c Update textcat training example and docs 2017-10-27 00:48:45 +02:00
ines
cc2917c9e8 Update fastText example and add to examples in docs 2017-10-26 18:47:02 +02:00
ines
daed7ff8fe Update information extraction examples 2017-10-26 18:46:11 +02:00
ines
b90e958975 Update tagger and parser examples and add to docs 2017-10-26 16:27:42 +02:00
ines
0575e9cf20 Add parser example to docs 2017-10-26 16:12:34 +02:00
ines
281f88a59c Update NER training examples 2017-10-26 14:44:43 +02:00
ines
8116d1a077 Add note on biluo_tags_from_offsets helper 2017-10-26 14:44:32 +02:00
ines
9bf78d5fb3 Update spacy.explain docs 2017-10-26 13:04:25 +02:00
ines
96b4214303 Add notes on pipe template inheritance in docs 2017-10-26 12:57:32 +02:00
ines
e6536d231f Update new entity type training example in docs 2017-10-25 22:17:23 +02:00
ines
400812d9b1 Add add_label method to Pipe 2017-10-25 22:17:11 +02:00
ines
70de2dd035 Display vectors in models directory if available (see #1457) 2017-10-25 16:15:37 +02:00
ines
1a722dac31 Merge branch 'develop' into feature/disable-pipes 2017-10-25 15:18:18 +02:00
ines
0102561f34 Update docs 2017-10-25 13:57:55 +02:00
ines
68e9de6917 Add documentation 2017-10-25 13:57:14 +02:00
ines
3484174e48 Add Language.path 2017-10-25 11:57:43 +02:00
ines
c815ff65f6 Update feature list 2017-10-24 21:49:11 +02:00
ines
d71702b827 Fix formatting 2017-10-24 20:11:04 +02:00
ines
6686e53530 Allow GitHub embeds to specify optional language 2017-10-24 16:00:56 +02:00
ines
56a47f137f Add title description for tokenizer 2017-10-24 16:00:56 +02:00
ines
3944c1d6e7 Document lemmatizer 2017-10-24 16:00:56 +02:00
ines
c9dc88ddfc Document current JSON format for training 2017-10-24 16:00:56 +02:00
Matthew Honnibal
ef3e5a361b Merge pull request #1442 from explosion/feature/fix-sp
💫Fix SP tag, tweak Vectors.__init__, fix Morphology
2017-10-24 10:24:07 +02:00
Matthew Honnibal
fdf25d10ba Merge pull request #1440 from ramananbalakrishnan/develop
Support single value for attribute list in doc.to_array
2017-10-24 10:23:12 +02:00
ines
7701984f13 Document Span.as_doc 2017-10-23 10:38:27 +02:00
ines
db15902e84 Tidy up 2017-10-23 10:38:21 +02:00
ines
3f0a157b33 Fix typo 2017-10-23 10:38:13 +02:00
Matthew Honnibal
ebecaddb76 Make 'data_or_width' two keyword args in Vectors.__init__
Previously the data and width options were one argument in Vectors,
which meant you couldn't say vectors = Vectors(strings, width=300).
It's better to have two keywords.
2017-10-20 14:17:15 +02:00
ines
108f1f786e Update symbols and document missing token attributes (see #1439) 2017-10-20 13:08:44 +02:00
ines
4acab77a8a Add missing symbol for LAW entities (resolves #1427) 2017-10-20 13:07:57 +02:00
Ramanan Balakrishnan
d44a079fe3
Update documentation on doc.to_array 2017-10-20 14:25:38 +05:30
Matthew Honnibal
61bc203f3f Merge pull request #1438 from explosion/feature/fast-parser
💫 Improve runtime CPU efficiency of parser/NER
2017-10-19 02:42:21 +02:00
Matthew Honnibal
d4cfff0476 Comment out currently hard-coded hyper-params 2017-10-19 00:47:24 +02:00
Ines Montani
f0d577e460 Merge pull request #1425 from explosion/feature/hindi-tokenizer
💫 Basic Hindi tokenization support
2017-10-18 13:34:52 +02:00
ines
a74cba2ffa Remove Binder from docs (now covered by Doc API) 2017-10-17 16:27:19 +02:00
ines
8ca344712d Add Language.has_pipe method 2017-10-17 11:20:07 +02:00
ines
4cfe259266 Fix formatting 2017-10-16 20:36:41 +02:00
ines
18793efef1 Remove Russian from v2.0 docs for now 2017-10-16 20:36:36 +02:00
ines
d383612225 Add note about word vectors in example (see #1117) 2017-10-16 20:31:58 +02:00
Matthew Honnibal
010a7309ff Merge pull request #1402 from explosion/feature/fix-matcher-operators
💫 Fix Matcher variable-length operators
2017-10-16 17:53:19 +02:00
ines
63393b4e0d Update matcher docs to reflect operator changes 2017-10-16 13:44:12 +02:00
ines
15514dc333 Add section on upgrading 2017-10-14 22:14:47 +02:00
ines
c0aceb9fbe Add Hindi to supported languages 2017-10-14 15:16:41 +02:00
ines
a5da683578 Add Russian to alpha docs and update tokenizer dependencies 2017-10-14 12:52:41 +02:00
ines
a69f4e56e5 Remove outdated aside 2017-10-14 12:52:07 +02:00
ines
bb6ecb82e5 Ensure long file paths in code examples break if needed 2017-10-14 12:51:52 +02:00
ines
bfd9506f1d Update extensions docs and add resources 2017-10-13 00:18:13 +02:00
ines
5f5d6897e8 Increment version 2017-10-13 00:18:02 +02:00
ines
9fd68334ab Add validate command docs 2017-10-12 23:36:48 +02:00
Ines Montani
37aa523a8e Merge pull request #1408 from explosion/feature/dot-underscore
💫 Custom attributes via Doc._, Token._ and Span._
2017-10-11 18:35:56 +02:00
ines
eac9e99086 Update docs on adding lemmatization to languages 2017-10-11 14:21:15 +02:00
ines
f4ae6763b9 Fix consistency of imports from spacy.tokens in examples 2017-10-11 02:30:40 +02:00
ines
19598ebfee Update migration guide 2017-10-10 06:38:11 +02:00
ines
9c96a6e131 Update pipelines section in v2 overview 2017-10-10 06:33:53 +02:00
Matthew Honnibal
09d61ada5e Merge pull request #1396 from explosion/feature/pipeline-management
💫 Improve pipeline and factory management
2017-10-10 04:29:54 +02:00
ines
6679117000 Add pipeline component examples 2017-10-10 04:26:06 +02:00
ines
7a592d01dc Update pipeline component usage docs 2017-10-10 04:24:39 +02:00
ines
3d5154811a Fix typo 2017-10-10 04:24:22 +02:00
ines
43b70651fb Document extension methods on Doc, Token and Span
set_extension, get_extension, has_extension
2017-10-10 04:23:37 +02:00
ines
b4fc6b203c Rename mixin 2017-10-10 04:22:23 +02:00
ines
de374dc72a Merge branch 'feature/pipeline-management' into feature/dot-underscore 2017-10-09 14:37:51 +02:00
ines
6c253db3fe Add section for developing spaCy extensions 2017-10-09 14:36:56 +02:00
ines
6550d0547c Fix typo 2017-10-09 14:36:36 +02:00
ines
4d248ea920 Fix spacing on bulleted lists 2017-10-09 14:36:30 +02:00
ines
2ac8b5c622 Add wrapper for before/after code examples 2017-10-09 14:36:20 +02:00
ines
ca6769fd48 Update spacy functions and remove removed set_factory 2017-10-07 15:28:01 +02:00