Commit Graph

  • 14b22b9d65 Cherry-pick is_empty() fix. Drop KnowledgeBase.__len()__. Raphael Mitsch 2023-03-02 13:17:47 +0100
  • 604cec0140 rely on is_empty property instead of __len__ (#12347) Sofie Van Landeghem 2023-03-01 12:06:07 +0100
  • 6f1632b3e9 Make generation of empty KnowledgeBase instances configurable in EntityLinker (#12320) Raphael Mitsch 2023-03-01 16:02:55 +0100
  • e325de3ff8 Displacy doc fix (#12352) kadarakos 2023-03-01 15:38:23 +0100
  • b3e7364551 rely on is_empty property instead of __len__ (#12347) Sofie Van Landeghem 2023-03-01 12:06:07 +0100
  • f87919d8f0 Update docs w.r.t. spacy.CandidateBatchGenerator.v1. (#12350) Raphael Mitsch 2023-03-01 11:01:35 +0100
  • 6aa6b86d49
    Make generation of empty KnowledgeBase instances configurable in EntityLinker (#12320) Raphael Mitsch 2023-03-01 16:02:55 +0100
  • da75896ef5
    Return Tuple[Span] for all Doc/Span attrs that provide spans (#12288) Adriane Boyd 2023-03-01 16:00:02 +0100
  • 6182213fef
    Merge branch 'master' into add/exclusive-spancat Adriane Boyd 2023-03-01 15:51:16 +0100
  • 5b9522d83d Displacy doc fix (#12352) kadarakos 2023-03-01 15:38:23 +0100
  • 56aa0cc75f
    Displacy doc fix (#12352) kadarakos 2023-03-01 15:38:23 +0100
  • 9bd498cdae Fix docstring and docs. Raphael Mitsch 2023-03-01 15:09:24 +0100
  • 257bca3959 Format. Raphael Mitsch 2023-03-01 14:54:03 +0100
  • fa390618c8 Adjust Candidate to support and mandate numerical entity IDs. Raphael Mitsch 2023-03-01 14:50:58 +0100
  • ac3433099d Require thinc v8.1.8 Adriane Boyd 2023-03-01 14:46:47 +0100
  • cfd60d1d7a Require thinc v8.1.7 Adriane Boyd 2023-03-01 14:44:38 +0100
  • 49abf4fb3a Rename Candidate to InMemoryCandidate, BaseCandidate to Candidate. Raphael Mitsch 2023-03-01 14:27:50 +0100
  • 417e8fea8b
    Update spacy/kb/candidate.py Raphael Mitsch 2023-03-01 13:51:33 +0100
  • 782821ed84 prettier kadarakos 2023-03-01 12:49:40 +0000
  • 21fa22de08 Merge branch 'refactor/el-candidates' of github.com:rmitsch/spaCy into refactor/el-candidates Raphael Mitsch 2023-03-01 13:48:46 +0100
  • f611517ce5 more details for color setting kadarakos 2023-03-01 12:27:34 +0000
  • a94dd3c82c more details for color setting kadarakos 2023-03-01 12:23:43 +0000
  • 3da0712582 Update doc string of BaseCandidate.__init__(). Raphael Mitsch 2023-03-01 13:15:38 +0100
  • 0680958476
    Update spacy/kb/candidate.py Raphael Mitsch 2023-03-01 12:42:08 +0100
  • 74cae47bf6
    rely on is_empty property instead of __len__ (#12347) Sofie Van Landeghem 2023-03-01 12:06:07 +0100
  • 3a58c19cc1 Switch from pickle to json for custom field serialization. Raphael Mitsch 2023-03-01 11:55:20 +0100
  • 09b302507c Update docstring. Raphael Mitsch 2023-03-01 11:35:40 +0100
  • 1c2b8b07fb Update docstring. Raphael Mitsch 2023-03-01 11:35:28 +0100
  • b4c891e452 Format. Raphael Mitsch 2023-03-01 11:32:37 +0100
  • 322b0050c4 Fix serialization of subclassed KB in tests. Raphael Mitsch 2023-03-01 11:29:23 +0100
  • 7261d724c6 Update docs w.r.t. spacy.CandidateBatchGenerator.v1. (#12350) Raphael Mitsch 2023-03-01 11:01:35 +0100
  • efbc3d37b3
    Update docs w.r.t. spacy.CandidateBatchGenerator.v1. (#12350) Raphael Mitsch 2023-03-01 11:01:35 +0100
  • 5ed1db7ae4 Add new tags in docs for #12334 (#12348) Adriane Boyd 2023-03-01 10:46:13 +0100
  • 18f4378a91 Fix error message for displacy auto_select_port (#12343) Adriane Boyd 2023-02-28 16:36:03 +0100
  • be673462be Add new REL_OPs: >+, >-, <+, and <- (#12334) TAN Long 2023-02-28 21:36:33 +0800
  • 33864f1d07
    Add new tags in docs for #12334 (#12348) Adriane Boyd 2023-03-01 10:46:13 +0100
  • e45e4419f1 Update docs w.r.t. spacy.CandidateBatchGenerator.v1. Raphael Mitsch 2023-03-01 10:44:11 +0100
  • f1c48baeb5 Remove doc update for batched candidate generation. Raphael Mitsch 2023-03-01 10:43:18 +0100
  • cc137073f6 Update KB serialization tests. Update docs. Raphael Mitsch 2023-03-01 10:38:11 +0100
  • e55e4d6048 add functionality for 'last=False' when adding a pipeline component svlandeg 2023-03-01 10:19:46 +0100
  • 69e2c49385 Add new tags in docs for #12334 Adriane Boyd 2023-03-01 10:03:29 +0100
  • 206fdcef6a rely on is_empty property instead of __len__ svlandeg 2023-03-01 09:39:03 +0100
  • 0b3480cf3f
    Update spacy/pipeline/tagger.pyx Vinit Ravishankar 2023-02-28 23:39:00 +0530
  • 68f6015e2e Add auxiliary methods for KB (and InMemoryLookupKB). Raphael Mitsch 2023-02-28 17:29:01 +0100
  • 8f058e39bd
    Fix error message for displacy auto_select_port (#12343) Adriane Boyd 2023-02-28 16:36:03 +0100
  • 50b34751eb Update docs. Raphael Mitsch 2023-02-28 15:38:28 +0100
  • 1aaf2b094d Fix error message for displacy auto_select_port Adriane Boyd 2023-02-28 15:29:22 +0100
  • 8596fb8b88 Change type for mentions to look up entity candidates for to SpanGroup from Iterable[Span]. Raphael Mitsch 2023-02-28 15:28:05 +0100
  • 071667376a
    Add new REL_OPs: >+, >-, <+, and <- (#12334) TAN Long 2023-02-28 21:36:33 +0800
  • a97ef65b33 Fix .entity_ typo in _add_activations() usage. Raphael Mitsch 2023-02-28 14:22:27 +0100
  • 5a9d8ba73c Format. Raphael Mitsch 2023-02-28 13:56:13 +0100
  • cd98ab4e95 Convert Candidate from Cython to Python class. Raphael Mitsch 2023-02-28 13:49:52 +0100
  • 7e270de51f set default to 0.0, update quickstart vinit 2023-02-28 15:40:33 +0530
  • e2de188cf1
    Bugfix/swedish tokenizer (#12315) lise-brinck 2023-02-27 10:53:45 +0100
  • dce0671a26 Revert "Fix FUZZY operator definition (#12318)" (#12336) Adriane Boyd 2023-02-27 09:48:36 +0100
  • 4539fbae17
    Revert "Fix FUZZY operator definition (#12318)" (#12336) Adriane Boyd 2023-02-27 09:48:36 +0100
  • e11a4c6a24 Revert argument name change Paul O'Leary McCann 2023-02-27 16:44:20 +0900
  • acdd993071
    Matcher performance fix for extension predicates: use shared key function (#12272) Kevin Humphreys 2023-02-26 23:35:08 -0800
  • 2669a9a06e Revert "Fix FUZZY operator definition (#12318)" Adriane Boyd 2023-02-27 08:23:52 +0100
  • b5146fcf6f Add tests for new REL_OPs: >+, >-, <+, and <-. Tan Long 2023-02-26 17:47:26 +0800
  • c6c63db0a0 Add immediate left/right child/parent dependency relations Tan Long 2023-02-26 17:01:30 +0800
  • c50e1150d5 Make Korean tokenizer easier to use Mycroft Kang 2023-02-25 00:46:42 +0900
  • c3defdf66e Filter cut states depending on whether its actions have cost Daniël de Kok 2023-02-24 15:55:00 +0100
  • f6108776aa fix formatting on examples Peter Baumgartner 2023-02-23 10:17:23 -0500
  • d37b2094f7 pull out parameter example data Peter Baumgartner 2023-02-23 09:56:07 -0500
  • b6d9f4c5ab Be more specific in KB serialization test. Raphael Mitsch 2023-02-23 15:14:18 +0100
  • 0a87fd8d22
    Add env vars for debug and disabled feature/radicli Ines Montani 2023-02-23 13:47:29 +0100
  • 2b469a8449
    Update radicli and add static CLI Ines Montani 2023-02-23 13:45:30 +0100
  • d8ac767e25 Update docs. Raphael Mitsch 2023-02-23 13:27:55 +0100
  • 449883db0f Format. Raphael Mitsch 2023-02-23 12:20:14 +0100
  • 705c4b976e Make empty_kb() configurable. Raphael Mitsch 2023-02-23 12:03:10 +0100
  • df4c069a13
    Remove backoff from .vector to .tensor (#12292) Adriane Boyd 2023-02-23 11:36:50 +0100
  • 1e8bac99f3
    Add tests for projects to master (#12303) Paul O'Leary McCann 2023-02-23 18:22:57 +0900
  • 14b27eebe2 Fix FUZZY operator definition (#12318) andyjessen 2023-02-23 01:37:40 -0700
  • daedc45d05
    Fix FUZZY operator definition (#12318) andyjessen 2023-02-23 01:37:40 -0700
  • 0f0f5db3b5 adjust edit distance in matcher usage docs too svlandeg 2023-02-23 09:28:51 +0100
  • af6d11c9d7 Fix FUZZY operator definition andyjessen 2023-02-22 19:58:55 -0700
  • 4535fc65b4 allow : as infix if succeeding char is uppercase Lise Brinck 2023-02-22 15:09:05 +0100
  • 4b667d3465 removed : from infixes in swedish punctuation.py Lise Brinck 2023-02-22 14:55:58 +0100
  • 8fc2a5f13b create punctuation.py for swedish Lise Brinck 2023-02-22 14:55:26 +0100
  • c4d8fb60bf add unittest for explosion#12311 Lise Brinck 2023-02-22 14:01:42 +0100
  • 0f3b23420b Also test max cuts with NER overfitting Daniël de Kok 2023-02-22 08:42:25 +0100
  • 10f5e9413d Small update_with_oracle_cut_size fixes Daniël de Kok 2023-02-21 15:51:29 +0100
  • e27c60a702
    Reimplement distillation with oracle cut size (#12214) Daniël de Kok 2023-02-21 15:47:18 +0100
  • e0d433b005 Write out condition more explicitly Daniël de Kok 2023-02-21 14:38:51 +0100
  • cb54af10a3
    Update requirements and fix test Ines Montani 2023-02-21 14:30:38 +0100
  • 413ca22587 run prettier kadarakos 2023-02-20 17:05:50 +0000
  • 6e5e77ea79 update docs kadarakos 2023-02-20 17:03:41 +0000
  • 86d3e78c64 make label mapper private kadarakos 2023-02-20 17:02:27 +0000
  • 344e2d6314 update tests to use label_smoothing = 0.0 vinit 2023-02-20 17:29:18 +0530
  • 1507e357dc black vinit 2023-02-20 17:01:47 +0530
  • bc2e082192 fix seed vinit 2023-02-20 16:59:04 +0530
  • c225991930 update docs vinit 2023-02-20 16:57:31 +0530
  • dc20ba5686 merge vinit 2023-02-20 16:56:30 +0530
  • ee5332eba5 bool -> float vinit 2023-02-20 16:54:37 +0530
  • 813b3551ed Merge branch 'add/exclusive-spancat' of github.com:ljvmiranda921/spaCy into spancat-exclusive kadarakos 2023-02-20 10:52:34 +0000
  • 6f3b257cf4 raise error instead of just print kadarakos 2023-02-20 10:48:41 +0000
  • 43d5cab2c2
    Update spacy/pipeline/spancat.py kadarakos 2023-02-20 11:37:51 +0100
  • e847487ebb remove duplicate declaration kadarakos 2023-02-20 10:36:54 +0000
  • af3fa670d4
    Update spacy/tests/pipeline/test_spancat.py kadarakos 2023-02-20 11:36:32 +0100