Acoustic-Image-Generation

Code for the paper "Audio-Visual Localization by Acoustic Image Generation", AAAI 2021

Requirements

The main script is used for training and testing the different models: UNet, DualCamNet with real and generated images.
The showimages_bb plots FlickrSoundnet energy from a UNet checkpoint and list of tfrecords of FlickrSoundnet.
The showimages plots ACIVW and AVIA energy from a UNet checkpoint and list of testing tfrecords.
The showvideo plots VGG Sound energy from a UNet checkpoint and list of tfrecords of a video.
The meanstd computes the mean the computed metrics of 5 experiments excluding for each metric min and max values and saves them in a xlsx file.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dataloader		dataloader
logger		logger
models		models
scripts		scripts
trainer		trainer
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
areaundercurve.py		areaundercurve.py
convert_data.py		convert_data.py
convert_data2.py		convert_data2.py
convert_data3.py		convert_data3.py
convert_data4.py		convert_data4.py
csvtxt.py		csvtxt.py
decodeimages.py		decodeimages.py
decodeimagesacresnet.py		decodeimagesacresnet.py
decodeimagesfusion.py		decodeimagesfusion.py
decodeimagesj.py		decodeimagesj.py
decodeimagesshow.py		decodeimagesshow.py
extract_features.py		extract_features.py
extract_features_unetraces.py		extract_features_unetraces.py
extract_fusion.py		extract_fusion.py
extract_j.py		extract_j.py
extract_triplet.py		extract_triplet.py
framecount.py		framecount.py
iouenergythreshold.py		iouenergythreshold.py
knn.py		knn.py
main.py		main.py
mean.py		mean.py
meanstd.py		meanstd.py
readave.py		readave.py
readcsv.py		readcsv.py
retrieve.py		retrieve.py
saveimagesresnet.py		saveimagesresnet.py
showimages.py		showimages.py
showimages_bb.py		showimages_bb.py
showimages_collected.py		showimages_collected.py
showimagesnotcorrespond.py		showimagesnotcorrespond.py
showvideo.py		showvideo.py
video.py		video.py