Amazon Sentiment Analysis

An amazon scraper, data cleaner, and transformer sentiment model, intended to systematically test Google Translate. The rest of this README is from the repository I cloned originally, which contained code for a transformer model I adapted to my own uses. My one regret is that I didn't save the translations Google Translate gave me; I observed that the results were quite stable, and didn't signifcantly change when adding more data, but as a result of this it's harder to retest that it could've been.

former

Simple transformer implementation from scratch in pytorch. See http://peterbloem.nl/blog/transformers for an in-depth explanation.

Limitations

The current models are designed to show the simplicity of transformer models and self-attention. As such they will not scale as far as the bigger transformers. For that you'll need a number of tricks that complicate the code (see the blog post for details).

All models so far are a single stack of transformer blocks (that is, no encoder/decoder structures). It turns out that this simple configuration often works best.

Use

You can clone the code and run the experiments from the root directory. E.g.

python experiments/classify.py

Hyperparameters are passed as command line arguments. The defaults should work well. The classification data is automatically downloaded, the wikipedia data is included in the repository.

You should be able to install as a package as well, with

pip install git+https://github.com/pbloem/former

but I haven't tried this. It's probably easier to just copy over the code you need. Let me know if you need this for anything and it doesn't work.

Requirements

Python 3.6+ is required.

The following should install all requirements pip install torch tb-nightly tqdm numpy torchtext

You may also need pip install future depending on the exact python version.

conda environment

The file environment.yml describes a complete conda environment with all dependencies. After cloning or downloading the project, you create the environment as follows:

conda env create -f environment.yml --name former
conda activate former

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data		data
experiments		experiments
former		former
runs		runs
tests		tests
LICENSE		LICENSE
README.md		README.md
captcha.html		captcha.html
debug.log		debug.log
environment.yml		environment.yml
ghostdriver.log		ghostdriver.log
modelDE.pkl		modelDE.pkl
modelEN.pkl		modelEN.pkl
modelJA.pkl		modelJA.pkl
prodlistDE.pkl		prodlistDE.pkl
prodlistEN.pkl		prodlistEN.pkl
prodlistJA.pkl		prodlistJA.pkl
prodlistTR.pkl		prodlistTR.pkl
reviewsDE.json		reviewsDE.json
reviewsDE.pkl		reviewsDE.pkl
reviewsEN.json		reviewsEN.json
reviewsEN.pkl		reviewsEN.pkl
reviewsJA.json		reviewsJA.json
reviewsJA.pkl		reviewsJA.pkl
reviewsTR.pkl		reviewsTR.pkl
setup.py		setup.py
test.py		test.py
translationtest-16cc3ba68b55.json		translationtest-16cc3ba68b55.json
translationtest-4b7abe263885.json		translationtest-4b7abe263885.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon Sentiment Analysis

former

Limitations

Use

Requirements

conda environment

About

Uh oh!

Releases

Packages

Languages

License

Quantum1000/Amazon-Sentiment

Folders and files

Latest commit

History

Repository files navigation

Amazon Sentiment Analysis

former

Limitations

Use

Requirements

conda environment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages