mirror of
https://github.com/explosion/spaCy.git
synced 2025-11-02 17:07:49 +03:00
* Consolidate and freeze symbols Instead of having symbol values defined in three potentially conflicting places (`spacy.attrs`, `spacy.parts_of_speech`, `spacy.symbols`), define all symbols in `spacy.symbols` and reference those values in `spacy.attrs` and `spacy.parts_of_speech`. Remove deprecated and placeholder symbols from `spacy.attrs.IDS`. Make `spacy.attrs.NAMES` and `spacy.symbols.NAMES` reverse dicts rather than lists in order to support future use of hash values in `attr_id_t`. Minor changes: * Use `uint64_t` for attrs in `Doc.to_array` to support future use of hash values * Remove unneeded attrs filter for error message in `Doc.to_array` * Remove unused attr `SENT_END` * Handle dynamic size of attr_id_t in Doc.to_array * Undo added warnings * Refactor to make Doc.to_array more similar to Doc.from_array * Improve refactoring |
||
|---|---|---|
| .. | ||
| __init__.pxd | ||
| __init__.py | ||
| doc_bin.py | ||
| doc.pxd | ||
| doc.pyi | ||
| doc.pyx | ||
| graph.pxd | ||
| graph.pyx | ||
| morphanalysis.pxd | ||
| morphanalysis.pyi | ||
| morphanalysis.pyx | ||
| retokenizer.pyi | ||
| retokenizer.pyx | ||
| span_group.pxd | ||
| span_group.pyi | ||
| span_group.pyx | ||
| span_groups.py | ||
| span.pxd | ||
| span.pyi | ||
| span.pyx | ||
| token.pxd | ||
| token.pyi | ||
| token.pyx | ||
| underscore.py | ||