Gibberish Detector

Overview

The Gibberish Detector is a simple Python script that evaluates whether a given text consists of meaningful words. It does this by comparing words in the input text against a predefined set of known words stored in words_set.pkl.

How It Works

The script loads a set of known words from words_set.pkl, which is included in this repository.
It defines a function, words_check(text), that:
- Converts the input text to lowercase.
- Splits the text into individual words.
- Checks how many words exist in the known words set.
- Returns a score between 0 and 1, representing the proportion of recognized words.

Installation

Clone this repository: git clone https://github.com/LMArantes/gibberish-detector.git cd gibberish-detector
Ensure you have Python 3 installed.

Usage

Import and use the words_check function:

from detector import words_check

text = "Hello world"
score = words_check(text)
print(f"Score: {score}")

Command-Line Usage

For users who prefer not to modify the code, the script can be run from the command line.

Check a Direct Text Input

You can analyze a text string directly by running: python detector_cli.py -t "Hello world"

Analyze a Text File

To analyze a .txt file, provide its path: python detector_cli.py -f path/to/file.txt

Score Interpretation

1.0 → All words are recognized.
0.0 → No words are recognized (likely gibberish).
A score between 0.0 and 1.0 indicates partial recognition.

License

This project is licensed under the Modified Attribution License (MAL) v1.0.

Buy me a coffee! ☕❤️

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
LICENSE		LICENSE
README.md		README.md
detector.py		detector.py
detector_cli.py		detector_cli.py
words_set.pkl		words_set.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gibberish Detector

Overview

How It Works

Installation

Usage

Command-Line Usage

Check a Direct Text Input

Analyze a Text File

Score Interpretation

License

About

Uh oh!

Languages

License

LMArantes/gibberish-detector

Folders and files

Latest commit

History

Repository files navigation

Gibberish Detector

Overview

How It Works

Installation

Usage

Command-Line Usage

Check a Direct Text Input

Analyze a Text File

Score Interpretation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages