Adding a new transposed convolution function to lax #5772

yang-song · 2021-02-18T07:26:15Z

This PR implements lax.gradient_based_conv_transpose. Compared to conv_transpose, it provides support for output_shape and output_padding. It matches the APIs for transposed convolutions derived from the gradient of a forward convolution, which is common in other deep learning frameworks such as TensorFlow, PyTorch, and Keras. This additional function on transposed convolution can make it much easier to reproduce code written in other (and currently more popular) frameworks.

froystig

Thank you!

The changes to lax.py include many file-wide formatting adjustments. Could you undo those? We probably don't want to take those at the moment, and it obscures the main change in the diff. You'll also want to squash commits so that there isn't one commit that makes formatting changes followed by another that undoes them.

froystig · 2021-02-18T23:50:15Z

I'm curious what @hawkinsp and @mattjj think about this change overall, including whether it is best to add it in lax, and for any review comments as well.

yang-song · 2021-02-19T01:20:54Z

Thank you!

The changes to lax.py include many file-wide formatting adjustments. Could you undo those? We probably don't want to take those at the moment, and it obscures the main change in the diff. You'll also want to squash commits so that there isn't one commit that makes formatting changes followed by another that undoes them.

Just removed formatting changes (done automatically by my IDE).

schrute99 · 2021-03-17T21:31:55Z

Are there any updates on this? It would be awesome to have this function.

yang-song · 2021-03-18T22:50:01Z

Pending on @froystig and @hawkinsp. I think the requested changes have been made.

hawkinsp · 2021-03-25T21:12:50Z

I'm not an expert on this, but I'm wondering what the pros and cons are of introducing a new API endpoint vs adding features like output_shape to the existing conv_transpose. What do you think? Is there a reason we need a new function? Is it conceptually different in some important way?

yang-song · 2021-03-25T21:21:43Z

Because the meaning of padding in conv_transpose is different from that of padding in gradient_based_conv_transpose. If we merge the APIs, we will either break all existing code using conv_tranpose (if adopting the padding in gradient_based_conv_transpose), or otherwise fail to match the APIs of other frameworks.

levskaya · 2021-04-02T01:55:44Z

Sorry, I wrote the original at a time when no frameworks really existed in JAX. (Nowadays, I'd probably not even add this function to "lax", since it's strictly a specialization of general convolutions, and delegate these matters to NN frameworks.)

Aside from a pending review of correctness, this mainly comes down to a question of organization:

Keep our old conv_transpose and delegate these specialized "conv templates" to frameworks.
Axe the old conv_transpose and use this new one to match other frameworks... but we probably have users of the old one that prevents that.
Fold the alternative behavior definition into the existing one under an optional flag.
Retrospectively I wish I had called the existing one "fractionally strided convolutions" since it's more accurate
maybe that's what we should do? Rename the old one and add the new if it matches what most people expect of "conv transpose"? This comes at the expense of polluting the lax namespace a bit.

schrute99 · 2021-04-02T10:09:07Z

The issue with 1. might be that at least for Flax and Objax, all the convolution modules are just wrappers of the functions in jax.lax. Also if every Jax framework implements its own transposed convolution, they might not be consistent.

codeboy5 · 2022-04-13T17:09:39Z

Hey is this issue being actively worked on ?

check: jax-ml#5772

younesbelkada · 2022-06-25T18:19:02Z

Hi all! Is there a plan to merge this PR? It seems that it is the root cause issue of converting some PyTorch models to JAX/FLAX , would be nice if we can merge it ;)

ericd-1qbit · 2022-06-29T17:34:02Z

The flax docs sent me here and I'd kindly like to add a +1 on hoping this PR will be merged soon :)

leiteg · 2023-06-21T18:50:13Z

Also came here from the Flax docs. Is there any other way to use transposed convolutions that are compatible with PyTorch's nn.ConvTranspose2d?

google-cla bot added the cla: yes label Feb 18, 2021

froystig requested changes Feb 18, 2021

View reviewed changes

froystig requested a review from hawkinsp February 18, 2021 23:49

yang-song force-pushed the patch-1 branch 5 times, most recently from 89dc631 to 6584a54 Compare February 19, 2021 01:15

add gradient based conv transpose

883a7c9

yang-song force-pushed the patch-1 branch from 6584a54 to 883a7c9 Compare February 19, 2021 01:19

yang-song requested a review from froystig February 19, 2021 01:21

zhangqiaorjc assigned hawkinsp Feb 22, 2021

hawkinsp requested a review from levskaya March 25, 2021 21:21

matthias-wright mentioned this pull request Feb 3, 2022

Adds HOWTO "Convert PyTorch Models to Flax" to docs google/flax#1848

Merged

andsteing mentioned this pull request Feb 8, 2022

Adding a new transposed convolution function (similar to torch.nn.ConvTranspose2d()) google/flax#1872

Open

andyehrenberg mentioned this pull request Mar 10, 2022

Attention, Transposed Convolutions, Embeddings, LayerNorm patrick-kidger/equinox#38

Merged

younesbelkada added a commit to younesbelkada/jax that referenced this pull request Jun 25, 2022

add transposed conv

5321ffb

check: jax-ml#5772

younesbelkada mentioned this pull request Jun 25, 2022

Add DPT Flax huggingface/transformers#17779

Closed

4 tasks

younesbelkada mentioned this pull request Sep 17, 2022

Add from_pt argument in .from_pretrained huggingface/diffusers#527

Merged

ASEM000 mentioned this pull request Oct 7, 2022

switch conv transpose padding to match PyTorch ASEM000/serket#2

Closed

matthieutrs mentioned this pull request Oct 27, 2023

adding torch.nn.ConvTranspose2d samuela/torch2jax#3

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding a new transposed convolution function to lax #5772

Adding a new transposed convolution function to lax #5772

Uh oh!

yang-song commented Feb 18, 2021

Uh oh!

froystig left a comment

Uh oh!

froystig commented Feb 18, 2021

Uh oh!

yang-song commented Feb 19, 2021

Uh oh!

schrute99 commented Mar 17, 2021

Uh oh!

yang-song commented Mar 18, 2021

Uh oh!

hawkinsp commented Mar 25, 2021

Uh oh!

yang-song commented Mar 25, 2021

Uh oh!

levskaya commented Apr 2, 2021

Uh oh!

schrute99 commented Apr 2, 2021

Uh oh!

codeboy5 commented Apr 13, 2022

Uh oh!

younesbelkada commented Jun 25, 2022

Uh oh!

ericd-1qbit commented Jun 29, 2022

Uh oh!

leiteg commented Jun 21, 2023

Uh oh!

Uh oh!

Adding a new transposed convolution function to lax #5772

Are you sure you want to change the base?

Adding a new transposed convolution function to lax #5772

Uh oh!

Conversation

yang-song commented Feb 18, 2021

Uh oh!

froystig left a comment

Choose a reason for hiding this comment

Uh oh!

froystig commented Feb 18, 2021

Uh oh!

yang-song commented Feb 19, 2021

Uh oh!

schrute99 commented Mar 17, 2021

Uh oh!

yang-song commented Mar 18, 2021

Uh oh!

hawkinsp commented Mar 25, 2021

Uh oh!

yang-song commented Mar 25, 2021

Uh oh!

levskaya commented Apr 2, 2021

Uh oh!

schrute99 commented Apr 2, 2021

Uh oh!

codeboy5 commented Apr 13, 2022

Uh oh!

younesbelkada commented Jun 25, 2022

Uh oh!

ericd-1qbit commented Jun 29, 2022

Uh oh!

leiteg commented Jun 21, 2023

Uh oh!

Uh oh!