spaCy/website/docs/api
Madeesh Kannan 446a3ecf34
StringStore refactoring (#11344)
* `strings`: Remove unused `hash32_utf8` function

* `strings`: Make `hash_utf8` and `decode_Utf8Str` private

* `strings`: Reorganize private functions

* 'strings': Raise error when non-string/-int types are passed to functions that don't accept them

* `strings`: Add `items()` method, add type hints, remove unused methods, restrict inputs to specific types, reorganize methods

* `Morphology`: Use `StringStore.items()` to enumerate features when pickling

* `test_stringstore`: Update pre-Python 3 tests

* Update `StringStore` docs

* Fix `get_string_id` imports

* Replace redundant test with tests for type checking

* Rename `_retrieve_interned_str`, remove `.get` default arg

* Add `get_string_id` to `strings.pyi`
Remove `mypy` ignore directives from imports of the above

* `strings.pyi`: Replace functions that consume `Union`-typed params with overloads

* `strings.pyi`: Revert some function signatures

* Update `SYMBOLS_BY_INT` lookups and error codes post-merge

* Revert clobbered change introduced in a previous merge

* Remove unnecessary type hint

* Invert tuple order in `StringStore.items()`

* Add test for `StringStore.items()`

* Revert "`Morphology`: Use `StringStore.items()` to enumerate features when pickling"

This reverts commit 1af9510ceb.

* Rename `keys` and `key_map`

* Add `keys()` and `values()`

* Add comment about the inverted key-value semantics in the API

* Fix type hints

* Implement `keys()`, `values()`, `items()` without generators

* Fix type hints, remove unnecessary boxing

* Update docs

* Simplify `keys/values/items()` impl

* `mypy` fix

* Fix error message, doc fixes
2022-10-06 10:51:06 +02:00
..
architectures.md Add experimental coref docs (#11291) 2022-09-27 18:11:23 +09:00
attributeruler.md Rename modules for consistency (#11286) 2022-08-10 11:44:05 +02:00
attributes.md Add API docs for token attribute symbols (#10836) 2022-06-23 08:16:38 +02:00
cli.md Add a way to get the URL to download a pipeline to the CLI (#11175) 2022-09-02 11:58:21 +02:00
coref.md Add experimental coref docs (#11291) 2022-09-27 18:11:23 +09:00
corpus.md Remove NBSP's across tables in the docs (#10842) 2022-05-25 09:48:39 +02:00
cython-classes.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython-structs.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
cython.md Update docs [ci skip] 2020-09-12 17:05:10 +02:00
data-formats.md adding spans to doc_annotation in Example.to_dict (#11261) 2022-08-05 12:26:38 +02:00
dependencymatcher.md add additional REL_OP (#10371) 2022-07-27 13:16:44 +02:00
dependencyparser.md Update docs for pipeline initialize() methods (#11221) 2022-08-03 16:53:02 +02:00
doc.md Merge branch 'copy_develop' into copy_v4 2022-10-03 14:12:16 +02:00
docbin.md Fix point typo on docbin docs (#9097) 2021-08-31 10:55:44 +02:00
edittreelemmatizer.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
entitylinker.md Merge branch 'copy_develop' into copy_v4 2022-10-03 14:12:16 +02:00
entityrecognizer.md Update docs for pipeline initialize() methods (#11221) 2022-08-03 16:53:02 +02:00
entityruler.md Rename modules for consistency (#11286) 2022-08-10 11:44:05 +02:00
example.md more explicit Example constructor example (#11489) 2022-09-16 09:26:33 +02:00
index.md Update v3 docs 2020-07-03 16:48:21 +02:00
kb_in_memory.md Refactor KB for easier customization (#11268) 2022-09-08 10:38:07 +02:00
kb.md Refactor KB for easier customization (#11268) 2022-09-08 10:38:07 +02:00
language.md Update docs to reflect Doc input to Language (#11555) 2022-09-29 18:50:29 +09:00
legacy.md Add ConsoleLogger.v2 (#11214) 2022-08-29 10:23:05 +02:00
lemmatizer.md Switch ru and uk lemmatizers to pymorphy3 (#11345) 2022-08-22 11:27:14 +02:00
lexeme.md fix 's typo's across code base (#8384) 2021-06-15 10:57:08 +02:00
lookups.md Update docs, types and API consistency 2020-08-17 16:45:24 +02:00
matcher.md Update/remove old Matcher syntax (#11370) 2022-08-30 15:40:31 +02:00
morphologizer.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
morphology.md Document Assigned Attributes of Pipeline Components (#9041) 2021-09-01 12:09:39 +02:00
phrasematcher.md Update/remove old Matcher syntax (#11370) 2022-08-30 15:40:31 +02:00
pipe.md Document scorers in registry and components from #8766 (#8929) 2021-08-12 12:50:03 +02:00
pipeline-functions.md Add experimental coref docs (#11291) 2022-09-27 18:11:23 +09:00
scorer.md Add experimental coref docs (#11291) 2022-09-27 18:11:23 +09:00
sentencerecognizer.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
sentencizer.md Update overwrite and scorer in API docs (#9384) 2021-10-11 10:35:07 +02:00
span-resolver.md Add experimental coref docs (#11291) 2022-09-27 18:11:23 +09:00
span.md Make Span/Doc.ents more consistent for ent_kb_id and ent_id (#11328) 2022-08-22 20:28:57 +02:00
spancategorizer.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
spangroup.md SpanGroup(s)-related optimizations (#11380) 2022-08-31 09:03:20 +02:00
spanruler.md the 'new' indicator wants a 'number' (#10997) 2022-06-21 22:01:16 +02:00
stringstore.md StringStore refactoring (#11344) 2022-10-06 10:51:06 +02:00
tagger.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
textcategorizer.md Store activations in Docs when save_activations is enabled (#11002) 2022-09-13 09:51:12 +02:00
tok2vec.md Update docs for pipeline initialize() methods (#11221) 2022-08-03 16:53:02 +02:00
token.md Make Span/Doc.ents more consistent for ent_kb_id and ent_id (#11328) 2022-08-22 20:28:57 +02:00
tokenizer.md Add tokenizer option to allow Matcher handling for all rules (#10452) 2022-03-24 13:21:32 +01:00
top-level.md Add docs for the spacy.models_and_pipes_with_nvtx_range.v1 callback (#11463) 2022-09-09 10:46:01 +02:00
transformer.md Update docs for pipeline initialize() methods (#11221) 2022-08-03 16:53:02 +02:00
vectors.md Docs for v3.3 (#10628) 2022-04-28 14:09:35 +02:00
vocab.md Add vector deduplication (#10551) 2022-03-30 08:54:23 +02:00