19th June, 2019
Time: 20:00 BST (15:00 EDT).
Joining instructions see above.
Intend to attend: Josh, RyanW, …
Attending: Josh, Martin, Ward, John, Ryan, Tony
Standing items
Introductions: new joinees should feel welcome to introduce themselves and their interest in zarr/n5.
Items
-
Ward: Unidata blog post.
-
https://www.unidata.ucar.edu/blogs/news/entry/netcdf-and-native-cloud-storage
-
Pretty good resonance
-
More technical details coming under developer blog
-
-
Josh: upcoming bioimaging blog post as well
-
“No new formats”
-
With technical follow up as well.
-
-
Ward: question
-
Meeting with various groups (incl. hdfgroup)
-
Re: HPC data storage
-
“How well does zarr perform in a super computer environment?”
-
Josh: Saalfeld’s spark usage at Janelia
-
Martin: may not be that much benefit on the reading side
-
Ryan: the question is do you have multiple write heads
-
-
Benchmarks in general
- Ward: generalizable dataset of benchmarks (existing or created)
would be good
- Dimensions: Chunk size, different filesystems (cloud, parallel,
hadoop, local)
- Ward: generalizable dataset of benchmarks (existing or created)
-
Ryan: peripheral feature - sparse arrays
- https://github.com/daletovar/zsparse
(thin wrapper)
-
3 1-d array is stored like a chunk zarr array
-
A few fixes sitting locally
-
Will be useful for single-cell work
- Spec changes? No. Application level thing that interacts with
underlying arrays.
-
E.g. wasteful to materialize all the nulls/zeros
- Using numpy methods. One of them is the three 1-d arrays.
Transforming indices from higher-dimensional space. You can pretend it’s dense.
-
https://en.wikipedia.org/wiki/Sparse_matrix#Compressed_sparse_row_(CSR,_CRS_or_Yale_format)
- Got to this via an attempt to add a database backend. Want a
virtual layer on top of zarr.
- https://github.com/zarr-developers/zarr/issues/438
- https://github.com/daletovar/zsparse
-
NetZDF anything to test yet?
-
Ward: not yet. Will publicize that widely.
-
May be made such that it can be extracted.
-
Asked by community and leadership to be more vocal
- Heard from Director. “Please put up a blog post”
-
Lots of interest
-
Contact? See generic link in blog post. (netcdf-support or ward
directly. Or GH page. Upcoming zarr support issue from Ward.)
-
-
SciPy (John)
- Ward: Unidata (another Ryan May) will be there from Pangeo. One
of the big Zarr boosters.
- Logo to be spread:
https://github.com/zarr-developers/zarr/issues/300#issuecomment-427064151
- <img src=”Pictures/10000201000001D6000000F6A360F8EF8A47245F.png”
style=”width:1.4953in;height:0.7819in” />
- Start using; likely be another iteration though (post scipy)
- Ward: Unidata (another Ryan May) will be there from Pangeo. One