A textual corpus database for the digital humanities.
-
Updated
Jul 26, 2020 - Jupyter Notebook
A textual corpus database for the digital humanities.
A tool for extracting chapters from Gutenberg Project Italian raw text e-books. RegEx are used to match chapter headings and extract the text between them.
Explore the Hávamál Interactive Site, an open-source project that combines digital humanities and cultural heritage to present ancient Norse wisdom. Featuring daily stanzas and interactive design, the site highlights themes of Virtue, Folly, and Wisdom while promoting community engagement and cultural preservation through digital storytelling.
Early research applying sentiment analysis to modernist literature (Woolf's To the Lighthouse). Introduces the "distributed heroine" model and middle reading methodology.
This is a "literary style imitation algorithm". The primary purpose is to mimic the style and tone of the original text. It creates new content based on the input text rather than directly copying existing content. It uses Markov chains for sentence generation and the ChatGPT-API for grammar cleanup.
Материалы тьюториала на III Московско-тартуской школе
A tool for converting French literary text into S-expression syntax trees for linguistic analysis, with visualization capabilities
Implementation of Distant Reading on Dystopian Literature (2) — Sentiment Analysis and Emotion Classification
This repository contains the source code for The Postmodern Generator, a tool designed to generate text that mimics the style of academic postmodern criticism. It offers customizable and extensible text generation features without relying on large language models.
Turn poems into living music: NLP meets melody, creative code, and literary alchemy.
Data from Mapping Balzac + Mapping Proust
Implementation of the NACL 2016 Best paper "Unsupervised Learning for Dynamic Fictional Relationships by Iyyer et al"
Jena Corpus of Expository and Fictional Prose; A Corpus of Canonical, Non-Canonical, Non-Fictional Texts
Complexity Analysis of Literary texts using BERT AND RoBERTa
Contains data of the Ficiton4 corpus and for our experiment on literary sentiment evocation
Trope detection using LLaMA: trope - llama
Character Network of Alfred de Musset's play Lorenzaccio
Gibson encoded cognitive mismatch in 1984, before the vocabulary existed to name it — a forensic, experiential reading of Neuromancer.
This is the final project for my Intro to Data Science (BIOS 611) course. We were tasked with applying data science techniques to a novel data set. I chose to conduct dimensionality reduction of "The Sound and the Fury" by William Faulkner to visualize semantic and temporal differences between narrators
Trope Miner — a local, privacy-first pipeline that mines narrative tropes from fiction using embeddings + LLMs, with review UI, span verification, semantic seeding, and calibration.
Add a description, image, and links to the literary-analysis topic page so that developers can more easily learn about it.
To associate your repository with the literary-analysis topic, visit your repo's landing page and select "manage topics."