Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

HDF5 isn't perfect but it does this kind of job pretty well. The C, C++, and HDF5 APIs are definitely not fun to use, but there are wonderful and intuitive APIs available in some languages---I'm thinking of Python's h5py here.

Let me add that the OP's experience that HDF5 files were less space efficient than comparable CSV files suggest that something was grossly amiss in his use of HDF5.



(I'm the author of the library) I'm almost certain you're correct - we (thought) we had compression enabled on our feature builder and never found the root cause, but regardless we're happy with how BTables ended up for the other reasons detailed. For future use cases we'll definitely be re-evaluating HDF5!


HDF5 is actually a pleasure to work with in python due to h5py - and it's quite an efficient data format.


Another interesting HDF wrapper in Rust: https://github.com/aldanor/hdf5-rs




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: