NestIO for large datasets #1001

morales-gregorio · 2021-06-18T11:15:28Z

Hi!

I was recently working together with @jasperalbers, who is using the NestIO to load simulated data from large scale multi-area models.

His problem was that he has hundreds of thousands of neurons (around 1GB in total), which when saved as neo.SpikeTrain objects it would take hours to load from disk (on all HDF5, pickle and nix). Incredible amounts of time were spent building the neo objects themselves. We found a rather unorthodox workaround to this problem, by saving the spikes directly as lists of lists, which brought down the load time to a few seconds.

We wrote a couple of extra functions to the NestIO to load the spike times as llists of lists, alongside the neuron IDs. This is obviously not ideal from a metadata perspective, but we thought it might still be a useful function to have, especially for agile analysis of large simulated data.

Let us know if this functions are any good, if you think they are worth including I can also write some tests.

Best,
Aitor

JuliaSprenger · 2021-06-18T11:23:31Z

Ha @morales-gregorio Thanks for sharing your code. I think this problem might be improved quite a lot when #1000 is being merged as this allows to generate lists of spiketrains based on a gdf based data organization (one array of timestamps & one array of unit ids) and only do the conversion to spiketrains when required.
We should revisit your code once #1000 is merged.

morales-gregorio · 2021-06-18T11:26:44Z

Indeed! #1000 looks like the solution to this problem! Looking forward to it, happy to contribute to merging this with the neo.SpikeTrainList once it is ready

morales-gregorio · 2022-01-28T10:23:44Z

Hi! I see that #1000 was merged already, any updates on implementing it within the NestIO?

morales-gregorio and others added 2 commits June 18, 2021 09:56

include list reading capability for exhorbitantly large simulations

f52cef1

Merge branch 'NeuralEnsemble:master' into exa_nestio

fd610c9

JuliaSprenger self-assigned this Jun 18, 2021

JuliaSprenger added this to the 0.11.0 milestone Jul 1, 2021

JuliaSprenger added enhancement IO labels Sep 13, 2021

apdavison modified the milestones: 0.10.3, 0.11.0 Aug 30, 2022

apdavison modified the milestones: 0.11.0, 0.12.0 Sep 29, 2022

JuliaSprenger modified the milestones: 0.12.0, 0.12.1, 0.13.0 Apr 2, 2023

apdavison modified the milestones: 0.13.0, 0.13.1 Feb 2, 2024

zm711 modified the milestones: 0.13.1, 0.14.0 May 3, 2024

zm711 modified the milestones: 0.14.0, 0.15.0 Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NestIO for large datasets #1001

NestIO for large datasets #1001

Uh oh!

morales-gregorio commented Jun 18, 2021

Uh oh!

JuliaSprenger commented Jun 18, 2021

Uh oh!

morales-gregorio commented Jun 18, 2021

Uh oh!

morales-gregorio commented Jan 28, 2022

Uh oh!

Uh oh!

NestIO for large datasets #1001

Are you sure you want to change the base?

NestIO for large datasets #1001

Uh oh!

Conversation

morales-gregorio commented Jun 18, 2021

Uh oh!

JuliaSprenger commented Jun 18, 2021

Uh oh!

morales-gregorio commented Jun 18, 2021

Uh oh!

morales-gregorio commented Jan 28, 2022

Uh oh!

Uh oh!