# 🗣️ Linguistics

### Gensim

* **Description**: A robust library for unsupervised topic modeling and natural language processing, using modern statistical machine learning.
* **Use Case**: Analyzing linguistic corpora, identifying semantic structure, and researching topics over large text datasets.
* **Documentation**: [Gensim Documentation](https://radimrehurek.com/gensim/)
* **GitHub Repository**: [Gensim GitHub](https://github.com/RaRe-Technologies/gensim)

### NLTK (Natural Language Toolkit)

* **Description**: A leading platform for building Python programs to work with human language data.
* **Use Case**: A wide range of linguistic tasks including tokenization, stemming, tagging, parsing, and semantic reasoning.
* **Documentation**: [NLTK Documentation](https://www.nltk.org/)
* **GitHub Repository**: [NLTK GitHub](https://github.com/nltk/nltk)

### NumPy

* **Description**: The fundamental package for scientific computing with Python.
* **Use Case**: Handling numerical and statistical operations that are common in computational linguistics and language modeling.
* **Documentation**: [NumPy Documentation](https://numpy.org/doc/)
* **GitHub Repository**: [NumPy GitHub](https://github.com/numpy/numpy)

### Pandas

* **Description**: Data analysis and manipulation library.
* **Use Case**: Organizing, analyzing, and manipulating linguistic datasets, such as corpora annotations, language use statistics, and experimental data.
* **Documentation**: [Pandas Documentation](https://pandas.pydata.org/)
* **GitHub Repository**: [Pandas GitHub](https://github.com/pandas-dev/pandas)

### Polyglot

* **Description**: A natural language pipeline that supports massive multilingual applications.
* **Use Case**: Multilingual entity recognition, sentiment analysis, language detection, and tokenization for linguistic research across different languages.
* **Documentation**: [Polyglot Documentation](https://polyglot.readthedocs.io/en/latest/)
* **GitHub Repository**: [Polyglot GitHub](https://github.com/aboSamoor/polyglot)

### Pyphen

* **Description**: A pure Python module to hyphenate text using existing hyphenation dictionaries.
* **Use Case**: Text processing for linguistic analysis that requires syllable segmentation or text justification in various languages.
* **Documentation**: [Pyphen Documentation](https://pyphen.org/)
* **GitHub Repository**: [Pyphen GitHub](https://github.com/Kozea/Pyphen)

### scikit-learn

* **Description**: Machine learning in Python.
* **Use Case**: Applying machine learning techniques to linguistic data for classification, clustering, and predictive modeling of language phenomena.
* **Documentation**: [scikit-learn Documentation](https://scikit-learn.org/stable/)
* **GitHub Repository**: [scikit-learn GitHub](https://github.com/scikit-learn/scikit-learn)

### spaCy

* **Description**: An open-source library for advanced natural language processing.
* **Use Case**: Parsing, tagging, and extracting semantic information from text, ideal for building linguistic models and analyzing language structure.
* **Documentation**: [spaCy Documentation](https://spacy.io/)
* **GitHub Repository**: [spaCy GitHub](https://github.com/explosion/spaCy)

### SpeechRecognition

* **Description**: A library for performing speech recognition, with support for several engines and APIs, online and offline.
* **Use Case**: Transcribing spoken language into text, useful in phonetics, phonology, and spoken language studies.
* **Documentation**: [SpeechRecognition Documentation](https://pypi.org/project/SpeechRecognition/)
* **GitHub Repository**: [SpeechRecognition GitHub](https://github.com/Uberi/speech_recognition)

### TextBlob

* **Description**: A library for processing textual data, providing simple APIs for common natural language processing tasks.
* **Use Case**: Sentiment analysis, part-of-speech tagging, and noun phrase extraction for linguistic analysis and language teaching.
* **Documentation**: [TextBlob Documentation](https://textblob.readthedocs.io/en/dev/)
* **GitHub Repository**: [TextBlob GitHub](https://github.com/sloria/TextBlob)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.pyclubs.org/python-across-all-disciplines/disciplines/linguistics.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
