LogoLogo
  • ๐Ÿ‘‹Welcome!
  • Disciplines
    • ๐Ÿš€ Aerospace Engineering
    • ๐Ÿ“’ Accounting and Finance
    • ๐ŸŒฑ Agriculture and Forestry
    • ๐Ÿบ Archaeology
    • ๐Ÿ™๏ธ Architecture and Urban Planning
    • ๐ŸŽจ Art and Art History
    • ๐Ÿš— Automotive Engineering
    • ๐Ÿ”ฌ Biology and Chemistry
    • ๐Ÿงช Chemical Engineering
    • ๐Ÿ’ป Computer Science and Engineering
    • ๐Ÿ’ฌ Communication Studies
    • ๐Ÿณ Culinary Arts
    • ๐Ÿ“Š Data Science and Statistics
    • ๐Ÿ’น Economics and Finance
    • ๐Ÿ“š Education
    • ๐ŸŒ Environmental Law and Policy
    • ๐ŸŒฟ Environmental Science
    • ๐Ÿ‘— Fashion and Textile Design
    • ๐ŸŒ Geography and Geosciences
    • ๐Ÿงฌ Genetics and Genomics
    • ๐Ÿฅ Health and Medicine
    • ๐Ÿ“– History
    • ๐Ÿจ Hospitality and Tourism
    • ๐Ÿ“ฐ Journalism and Media Studies
    • โš–๏ธ Law
    • ๐Ÿ—ฃ๏ธ Linguistics
    • ๐ŸŒŠ Maritime Studies and Oceography
    • โž— Mathematics
    • ๐Ÿ› ๏ธ Mechanical Engineering
    • ๐ŸŽต Music and Musicology
    • ๐ŸŽญ Performing Arts
    • ๐Ÿ’ญ Philosophy
    • ๐ŸŒŒ Physics and Astronomy
    • ๐Ÿ›๏ธ Political Science and International Relations
    • ๐Ÿง  Psychology
    • ๐Ÿ•Š๏ธ Religious Studies
    • ๐Ÿ‘ฅ Social Sciences
    • ๐Ÿƒโ€โ™‚๏ธ Sports Science
    • ๐Ÿพ Veterinary Science
  • Collaborating
    • ๐ŸคHow to contribute
Powered by GitBook
On this page
  • Gensim
  • NLTK (Natural Language Toolkit)
  • NumPy
  • Pandas
  • Polyglot
  • Pyphen
  • scikit-learn
  • spaCy
  • SpeechRecognition
  • TextBlob

Was this helpful?

Edit on GitHub
  1. Disciplines

๐Ÿ—ฃ๏ธ Linguistics

Previousโš–๏ธ LawNext๐ŸŒŠ Maritime Studies and Oceography

Last updated 1 year ago

Was this helpful?

Gensim

  • Description: A robust library for unsupervised topic modeling and natural language processing, using modern statistical machine learning.

  • Use Case: Analyzing linguistic corpora, identifying semantic structure, and researching topics over large text datasets.

  • Documentation:

  • GitHub Repository:

NLTK (Natural Language Toolkit)

  • Description: A leading platform for building Python programs to work with human language data.

  • Use Case: A wide range of linguistic tasks including tokenization, stemming, tagging, parsing, and semantic reasoning.

  • Documentation:

  • GitHub Repository:

NumPy

  • Description: The fundamental package for scientific computing with Python.

  • Use Case: Handling numerical and statistical operations that are common in computational linguistics and language modeling.

  • Documentation:

  • GitHub Repository:

Pandas

  • Description: Data analysis and manipulation library.

  • Use Case: Organizing, analyzing, and manipulating linguistic datasets, such as corpora annotations, language use statistics, and experimental data.

Polyglot

  • Description: A natural language pipeline that supports massive multilingual applications.

  • Use Case: Multilingual entity recognition, sentiment analysis, language detection, and tokenization for linguistic research across different languages.

Pyphen

  • Description: A pure Python module to hyphenate text using existing hyphenation dictionaries.

  • Use Case: Text processing for linguistic analysis that requires syllable segmentation or text justification in various languages.

scikit-learn

  • Description: Machine learning in Python.

  • Use Case: Applying machine learning techniques to linguistic data for classification, clustering, and predictive modeling of language phenomena.

spaCy

  • Description: An open-source library for advanced natural language processing.

  • Use Case: Parsing, tagging, and extracting semantic information from text, ideal for building linguistic models and analyzing language structure.

SpeechRecognition

  • Description: A library for performing speech recognition, with support for several engines and APIs, online and offline.

  • Use Case: Transcribing spoken language into text, useful in phonetics, phonology, and spoken language studies.

TextBlob

  • Description: A library for processing textual data, providing simple APIs for common natural language processing tasks.

  • Use Case: Sentiment analysis, part-of-speech tagging, and noun phrase extraction for linguistic analysis and language teaching.

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Gensim Documentation
Gensim GitHub
NLTK Documentation
NLTK GitHub
NumPy Documentation
NumPy GitHub
Pandas Documentation
Pandas GitHub
Polyglot Documentation
Polyglot GitHub
Pyphen Documentation
Pyphen GitHub
scikit-learn Documentation
scikit-learn GitHub
spaCy Documentation
spaCy GitHub
SpeechRecognition Documentation
SpeechRecognition GitHub
TextBlob Documentation
TextBlob GitHub