(In Python) Can you save an object that is in memory to disk and reload it at a later time?

spacemanspiffy@lemmy.world · 1 year ago

(In Python) Can you save an object that is in memory to disk and reload it at a later time?

solrize@lemmy.world · 1 year ago

The quick answer is to use a serialization/deserialization library like pickle. You can’t just dump a binary image and reload it in any simple way.

UnfortunateShort@lemmy.world · 1 year ago

I think pickle is what you want.

Keep in mind that this might have a huge performance impact if you do it all the time - it’s still IO even when it’s not parsing.

spacemanspiffy@lemmy.world · 1 year ago

My idea would be to load one larger file one time and not parse anything, and keep it in memory the entire time. Versus what it does now which is load the files and parse them and keep everything in memory.

But three people responding here so far with “pickle” so maybe that is the way.

UnfortunateShort@lemmy.world · 1 year ago

You can stuff all the info into an object and use it this way, no problem. I just wanted to point out that this doesn’t have zero performance impact compared to what you currently have.

So (depending on how your OS caches files) you might not want to do this like twice in a lambda that you pass to an iterator over a huge slice or something.

WolfLink@sh.itjust.works · 1 year ago

What is the “executable” in this context? I’m kinda confused as to what you are looking for.

What’s wrong with parsing the input files at runtime? Is it performance? Do you want one file to load instead of multiple?

Many have suggested pickle, which is kinda what you are asking for, but on some level it’s not much different from parsing the input files. Also, depending on your code, you may have to write custom serialization code as part of getting pickle to work.

Note that pretty much every modern game is a bundle of often multiple pieces of executable code alongside a whole bunch of separate assets.

kevincox@lemmy.ml · 1 year ago

I don’t want the end executable to have to bundle these files and re-parse them each time it gets run.

No matter how you persist data you will need to re-parse it. The question is really just if the new format is more efficient to read than the old format. Some formats such as FlatBuffers and Cap'n Proto are designed to have very efficient loading processes.

(Well technically you could persist the process image to disk, but this tends to be much larger than serialized data would be and has issues such as defeating ASLR. This is very rarely done.)

Lots of people are talking about Pickle. But it isn’t particularly fast. That being side with Python you can’t expect much to start with.