Matthew Honnibal
|
14eafcab15
|
* Refactor to use vector[bint]
|
2015-07-12 05:27:47 +02:00 |
|
Matthew Honnibal
|
6a6e852a39
|
* Refactor huffman coding stuff into class
|
2015-07-12 05:06:36 +02:00 |
|
Matthew Honnibal
|
aad96fdb5c
|
* Improve efficiency of huffman coding
|
2015-07-12 01:31:37 +02:00 |
|
Matthew Honnibal
|
ff9ff6f3fa
|
* Ensure unseen words are given low log probability
|
2015-07-12 01:31:09 +02:00 |
|
Matthew Honnibal
|
9d3b0d83de
|
* Refactor huffman coding
|
2015-07-11 22:27:43 +02:00 |
|
Matthew Honnibal
|
8d29406cd6
|
* Rename span.right to span.rights
|
2015-07-11 22:15:04 +02:00 |
|
Matthew Honnibal
|
da9f358166
|
* Fix span getting
|
2015-07-11 21:41:41 +02:00 |
|
Matthew Honnibal
|
11e8f2ffb4
|
* Huffman codes working
|
2015-07-11 20:01:10 +02:00 |
|
Matthew Honnibal
|
cb6fc81909
|
* Work on huffman coding.
|
2015-07-11 15:23:35 +02:00 |
|
Matthew Honnibal
|
4c9b77fe95
|
* Begin working on serialization code
|
2015-07-11 10:57:30 +02:00 |
|
Matthew Honnibal
|
53d1f5b2eb
|
* Rename Span.head to Span.root.
|
2015-07-09 17:30:58 +02:00 |
|
Matthew Honnibal
|
c0255ed7d8
|
* Allow slice indexing in Doc.__getitem__, returning a Span object
|
2015-07-09 15:15:32 +02:00 |
|
Matthew Honnibal
|
89a91ad726
|
* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity
|
2015-07-09 13:30:41 +02:00 |
|
Matthew Honnibal
|
55f1042443
|
* Improve efficiency of L and R features, correcting the non-linear-in-length problem.
|
2015-07-09 12:17:26 +02:00 |
|
Matthew Honnibal
|
70d2acb579
|
* Fix edge features
|
2015-07-09 12:15:01 +02:00 |
|
Matthew Honnibal
|
adb868bdad
|
* Add warning for models not found in parser
|
2015-07-08 20:04:55 +02:00 |
|
Matthew Honnibal
|
05b28ec9eb
|
* Add warning for models not found in parser
|
2015-07-08 20:02:13 +02:00 |
|
Matthew Honnibal
|
ef700401a6
|
* Add warning for models not found in parser
|
2015-07-08 20:00:46 +02:00 |
|
Matthew Honnibal
|
6218d8b389
|
* Add warning for models not found in parser
|
2015-07-08 19:59:16 +02:00 |
|
Matthew Honnibal
|
f6a6c39ce8
|
* Add warning for models not found in parser
|
2015-07-08 19:52:30 +02:00 |
|
Matthew Honnibal
|
78db7e32f7
|
* Remove has_sense method from Lexeme declaration
|
2015-07-08 19:41:20 +02:00 |
|
Matthew Honnibal
|
6ddb2f5e45
|
* Restore merge_mwe in English class
|
2015-07-08 19:35:30 +02:00 |
|
Matthew Honnibal
|
6859f6adac
|
* Restore merge_mwe in English class
|
2015-07-08 19:34:55 +02:00 |
|
Matthew Honnibal
|
3c270fc8ff
|
* Remove has_sense method from Lexeme
|
2015-07-08 19:28:29 +02:00 |
|
Matthew Honnibal
|
b64c843861
|
* Remove senses attr
|
2015-07-08 19:26:24 +02:00 |
|
Matthew Honnibal
|
1d3a592edf
|
* Remove the senses attr from LexemeC, to keep data compatibility
|
2015-07-08 19:24:44 +02:00 |
|
Matthew Honnibal
|
0ceb1f71c2
|
* Update parse features
|
2015-07-08 19:11:36 +02:00 |
|
Matthew Honnibal
|
2e51b5027a
|
* Alias Doc to Tokens, for backwards compatibility
|
2015-07-08 18:59:35 +02:00 |
|
Matthew Honnibal
|
e3c53f5ecd
|
* Fix mention of Tokens in docstring
|
2015-07-08 18:56:27 +02:00 |
|
Matthew Honnibal
|
bb522496dd
|
* Rename Tokens to Doc
|
2015-07-08 18:53:00 +02:00 |
|
Matthew Honnibal
|
b24e8be2b9
|
* Whitespace in docstring
|
2015-07-08 12:37:03 +02:00 |
|
Matthew Honnibal
|
abc43b852d
|
* Add pos_tags attr to Vocab.
|
2015-07-08 12:36:38 +02:00 |
|
Matthew Honnibal
|
935bcdf3e5
|
* Remove redundant tag_names argument to Tokenizer
|
2015-07-08 12:36:04 +02:00 |
|
Matthew Honnibal
|
ff885e8511
|
* Add ParserFactory convenience function
|
2015-07-08 12:35:46 +02:00 |
|
Matthew Honnibal
|
4e4fac452b
|
* Refactor __init__ for simplicity. Allow parse=True, tag=True etc flags to be passed at top-level. Do not lazy-load parser.
|
2015-07-08 12:35:29 +02:00 |
|
Matthew Honnibal
|
1d2deb4616
|
* Work on refactoring default arguments to English.__init__
|
2015-07-07 15:53:25 +02:00 |
|
Matthew Honnibal
|
2d0e99a096
|
* Pass pos_tags into Tokenizer.from_dir
|
2015-07-07 14:23:08 +02:00 |
|
Matthew Honnibal
|
6788c86b2f
|
* Begin refactor
|
2015-07-07 14:00:07 +02:00 |
|
Matthew Honnibal
|
52fd80c6c6
|
* Add experimental supersense features for parsing, based on lookup into wordnet.
|
2015-07-01 20:12:44 +02:00 |
|
Matthew Honnibal
|
e6d828a9af
|
* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.
|
2015-07-01 20:12:13 +02:00 |
|
Matthew Honnibal
|
2b8459d9a8
|
* Add senses flag to Lexeme
|
2015-07-01 20:10:41 +02:00 |
|
Matthew Honnibal
|
e23d1582a2
|
* Add supersense data to Lexeme objects. Add simple has_sense method to check the flag.
|
2015-07-01 18:50:37 +02:00 |
|
Matthew Honnibal
|
64fafa98be
|
* Add senses.pyx and senses.pxd
|
2015-07-01 18:49:44 +02:00 |
|
Matthew Honnibal
|
94dab94e5f
|
uerge branch 'master' of https://github.com/honnibal/spaCy
|
2015-06-30 18:16:26 +02:00 |
|
Matthew Honnibal
|
9af86b0b0b
|
* Fix attrs.pxd
|
2015-06-30 18:16:30 +02:00 |
|
Matthew Honnibal
|
af9c82f7a6
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-06-30 18:11:37 +02:00 |
|
Matthew Honnibal
|
5d595b5a8c
|
* Inc versions
|
2015-06-30 18:11:06 +02:00 |
|
Matthew Honnibal
|
d2eeba6667
|
* Start wiring up color and emotion lexicons. Hopefully we get to use them.
|
2015-06-30 16:22:23 +02:00 |
|
Matthew Honnibal
|
3bb5876c5a
|
* Inline methods in StateClass
|
2015-06-29 01:10:14 +02:00 |
|
Matthew Honnibal
|
a02fd3af5d
|
* Check valency in L and R feature methods, to make feaure calculation faster
|
2015-06-29 00:27:56 +02:00 |
|