mirror of
https://github.com/explosion/spaCy.git
synced 2024-11-11 12:18:04 +03:00
Merge pull request #1128 from thinline72/patch-1
Changed the capital of Lithuania to Vilnius
This commit is contained in:
commit
f64e3efc76
|
@ -77,7 +77,7 @@ p
|
||||||
|
|
||||||
+code.
|
+code.
|
||||||
doc1 = nlp(u"Paris is the largest city in France.")
|
doc1 = nlp(u"Paris is the largest city in France.")
|
||||||
doc2 = nlp(u"Ljubljana is the capital of Lithuania.")
|
doc2 = nlp(u"Vilnius is the capital of Lithuania.")
|
||||||
doc3 = nlp(u"An emu is a large bird.")
|
doc3 = nlp(u"An emu is a large bird.")
|
||||||
|
|
||||||
for doc in [doc1, doc2, doc3]:
|
for doc in [doc1, doc2, doc3]:
|
||||||
|
@ -85,13 +85,13 @@ p
|
||||||
print(doc.similarity(other_doc))
|
print(doc.similarity(other_doc))
|
||||||
|
|
||||||
p
|
p
|
||||||
| Even though the sentences about Paris and Ljubljana consist of different
|
| Even though the sentences about Paris and Vilnius consist of different
|
||||||
| words and entities, they both describe the same concept and are seen as
|
| words and entities, they both describe the same concept and are seen as
|
||||||
| more similar than the sentence about emus. In this case, even a misspelled
|
| more similar than the sentence about emus. In this case, even a misspelled
|
||||||
| version of "Ljubljana" would still produce very similar results.
|
| version of "Vilnius" would still produce very similar results.
|
||||||
|
|
||||||
+table
|
+table
|
||||||
- var examples = {"Paris is the largest city in France.": [1, 0.84, 0.65], "Ljubljana is the capital of Lithuania.": [0.84, 1, 0.52], "An emu is a large bird.": [0.65, 0.52, 1]}
|
- var examples = {"Paris is the largest city in France.": [1, 0.85, 0.65], "Vilnius is the capital of Lithuania.": [0.85, 1, 0.55], "An emu is a large bird.": [0.65, 0.55, 1]}
|
||||||
- var counter = 0
|
- var counter = 0
|
||||||
|
|
||||||
+row
|
+row
|
||||||
|
|
Loading…
Reference in New Issue
Block a user