Jax.experimental.sparse for better memory/speed performance?

bball369 · April 8, 2024, 6:23pm

Curious if anyone has used jax.experimental.sparse when passing data to a numpyro model? Has anyone had success from a memory and speed perspective when using the BCOO matrices and sparsifying all of the jnp functions?

fehiepsi · April 8, 2024, 6:51pm

Hi @bball369, you can check out the CAR distribution as an example.

bball369 · April 8, 2024, 7:07pm

I see. This is helpful but this is not using jax it seems. I want to take advantage of jax compilation still. Was curious about the jax sparse implementation and if anyone recommends it

fehiepsi · April 8, 2024, 7:53pm

The sample method does not leverage the sparsity (though I think it should be doable) but the log prob involves sparse matmul operators. The distribution should also be jit-compiled.

bball369 · April 8, 2024, 7:55pm

so you’re saying i can jit compile with scipy sparse matrices?

fehiepsi · April 8, 2024, 7:57pm

My experience is limitted. I used jax sparse on some graph neural network stuff. It works fine IIRC.

fehiepsi · April 8, 2024, 8:05pm

If you look at the implementation, you will see how scipy matrix is converted into a jax scipy matrix. Then the rest of the implementation should be jit compiled.

bball369 · April 8, 2024, 8:53pm

hmm I’m not seeing that in the code. I see this

            # TODO: look into future jax sparse csr functionality and other developments
            self.adj_matrix = _to_sparse(adj_matrix)

fehiepsi · April 8, 2024, 8:56pm

It is something like

adj_matrix = BCOO.from_scipy_sparse(adj_matrix)
... adj_matrix @ phi[..., jnp.newaxis]

There are two scenarios depending on your application:

if your sparse matrix is the input of your jitted program, you will need to convert to jax sparse outside your program
if your sparse matrix is a global constant, you can convert to jax sparse inside your program

bball369 · April 8, 2024, 9:14pm

thanks so much for the help. so i am doing approach 1. So then i need to sparsify all of the jnp functions in my model too, correct? seems annoying but perhaps it is worth it

fehiepsi · April 8, 2024, 10:02pm

that sounds right to me.