LogoLogo
  • ๐Ÿ‘‹Welcome!
  • Disciplines
    • ๐Ÿš€ Aerospace Engineering
    • ๐Ÿ“’ Accounting and Finance
    • ๐ŸŒฑ Agriculture and Forestry
    • ๐Ÿบ Archaeology
    • ๐Ÿ™๏ธ Architecture and Urban Planning
    • ๐ŸŽจ Art and Art History
    • ๐Ÿš— Automotive Engineering
    • ๐Ÿ”ฌ Biology and Chemistry
    • ๐Ÿงช Chemical Engineering
    • ๐Ÿ’ป Computer Science and Engineering
    • ๐Ÿ’ฌ Communication Studies
    • ๐Ÿณ Culinary Arts
    • ๐Ÿ“Š Data Science and Statistics
    • ๐Ÿ’น Economics and Finance
    • ๐Ÿ“š Education
    • ๐ŸŒ Environmental Law and Policy
    • ๐ŸŒฟ Environmental Science
    • ๐Ÿ‘— Fashion and Textile Design
    • ๐ŸŒ Geography and Geosciences
    • ๐Ÿงฌ Genetics and Genomics
    • ๐Ÿฅ Health and Medicine
    • ๐Ÿ“– History
    • ๐Ÿจ Hospitality and Tourism
    • ๐Ÿ“ฐ Journalism and Media Studies
    • โš–๏ธ Law
    • ๐Ÿ—ฃ๏ธ Linguistics
    • ๐ŸŒŠ Maritime Studies and Oceography
    • โž— Mathematics
    • ๐Ÿ› ๏ธ Mechanical Engineering
    • ๐ŸŽต Music and Musicology
    • ๐ŸŽญ Performing Arts
    • ๐Ÿ’ญ Philosophy
    • ๐ŸŒŒ Physics and Astronomy
    • ๐Ÿ›๏ธ Political Science and International Relations
    • ๐Ÿง  Psychology
    • ๐Ÿ•Š๏ธ Religious Studies
    • ๐Ÿ‘ฅ Social Sciences
    • ๐Ÿƒโ€โ™‚๏ธ Sports Science
    • ๐Ÿพ Veterinary Science
  • Collaborating
    • ๐ŸคHow to contribute
Powered by GitBook
On this page
  • Beautiful Soup
  • Gensim
  • Matplotlib
  • NetworkX
  • NLTK (Natural Language Toolkit)
  • NumPy
  • OCRmyPDF
  • Pandas
  • Plotly
  • spaCy
  • TextBlob
  • Tesseract OCR

Was this helpful?

Edit on GitHub
  1. Disciplines

๐Ÿ“– History

Previous๐Ÿฅ Health and MedicineNext๐Ÿจ Hospitality and Tourism

Last updated 1 year ago

Was this helpful?

Beautiful Soup

  • Description: A library for pulling data out of HTML and XML files.

  • Use Case: Scraping historical data, documents, and archives from websites for digital humanities projects.

  • Documentation:

  • GitHub Repository:

Gensim

  • Description: A robust semantic modeling library, useful for unsupervised topic modeling and natural language processing.

  • Use Case: Analyzing historical texts and documents to uncover thematic structures and trends over time.

  • Documentation:

  • GitHub Repository:

Matplotlib

  • Description: A plotting library for creating static, animated, and interactive visualizations in Python.

  • Use Case: Visualizing historical data, such as timelines, population growth, or economic changes over time.

  • Documentation:

  • GitHub Repository:

NetworkX

  • Description: A Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks.

  • Use Case: Modeling historical events and relationships, such as social networks, trade routes, or communication networks in historical contexts.

NLTK (Natural Language Toolkit)

  • Description: A leading platform for building Python programs to work with human language data.

  • Use Case: Text analysis and linguistic study of historical documents, including language evolution, stylistic changes, and content analysis.

NumPy

  • Description: Fundamental package for scientific computing with Python.

  • Use Case: Handling numerical data for statistical analysis in historical research.

OCRmyPDF

  • Description: Adds an OCR text layer to PDF files, allowing them to be searched.

  • Use Case: Converting scanned historical documents and texts into searchable and analyzable PDF formats.

Pandas

  • Description: Data analysis and manipulation library.

  • Use Case: Organizing, analyzing, and manipulating historical datasets, such as census data, economic records, or archaeological findings.

Plotly

  • Description: An interactive graphing library.

  • Use Case: Creating interactive visualizations for presenting historical data and findings.

spaCy

  • Description: An open-source software library for advanced natural language processing.

  • Use Case: Processing and analyzing large volumes of historical texts for semantic content, named entity recognition, and thematic analysis.

TextBlob

  • Description: A library for processing textual data, providing simple APIs for common natural language processing tasks.

  • Use Case: Sentiment analysis, part-of-speech tagging, and classification of historical narratives and documents.

Tesseract OCR

  • Description: An optical character recognition (OCR) engine.

  • Use Case: Extracting text from images of historical documents, enabling digitization and analysis of archival materials.


Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

GitHub Repository:

Documentation:

Beautiful Soup Documentation
Beautiful Soup GitHub
Gensim Documentation
Gensim GitHub
Matplotlib Documentation
Matplotlib GitHub
NetworkX Documentation
NetworkX GitHub
NLTK Documentation
NLTK GitHub
NumPy Documentation
NumPy GitHub
OCRmyPDF GitHub
Pandas Documentation
Pandas GitHub
Plotly Documentation
Plotly GitHub
spaCy Documentation
spaCy GitHub
TextBlob Documentation
TextBlob GitHub
Tesseract OCR GitHub