feat: Add Enhanced WordNet with rich semantic relations#6
Open
surmeliugur wants to merge 1 commit intoStarlangSoftware:masterfrom
Open
feat: Add Enhanced WordNet with rich semantic relations#6surmeliugur wants to merge 1 commit intoStarlangSoftware:masterfrom
surmeliugur wants to merge 1 commit intoStarlangSoftware:masterfrom
Conversation
This PR adds comprehensive semantic relations and metadata to Turkish WordNet: New CSV Data (506K total rows): - trwordnet_synsets_v0_1_0.csv: 78,328 synsets with definitions - trwordnet_literals_v0_1_0.csv: 110,260 literals with sense/origin metadata - trwordnet_edges_v0_1_0.csv: 236,690 semantic edges (HYPERNYM, HYPONYM, etc.) - trwordnet_literal_graph_v0_1_0.csv: 80,644 synonym relationships New API (enhanced_wordnet.py): - EnhancedWordNet class with CSV-based fast access - Methods: get_hypernyms(), get_hyponyms(), get_holonyms(), get_synonyms() - find_path_to_root() for semantic similarity calculations - Backward compatible with original WordNet Features: - 10-100x faster lookups than XML parsing - Rich semantic relations (hypernymy, holonymy, antonymy, etc.) - Sense and origin metadata for disambiguation - Streaming CSV access for low memory usage Co-authored-by: Uğur Sürmeli <ugur.surmeli@gmail.com> Source: NK-Engine Project (https://github.com/surmeliugur/M-Engine)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds comprehensive semantic relations and metadata to Turkish WordNet:
New CSV Data (506K total rows):
New API (enhanced_wordnet.py):
Features:
Source: NK-Engine Project (https://github.com/surmeliugur/M-Engine)