Prepack conv weights #31

pinzhenx · 2020-05-28T04:47:47Z

No description provided.

torch_ipex/csrc/aten_ipex_bridge.cpp

torch_ipex/csrc/cpu/DevOPs.cpp

torch_ipex/csrc/cpu/ShadeDataContext.h

jgong5 · 2020-05-28T11:40:35Z

I have a concern related but not specific to this PR w.r.t. the in-place changes of input data/weight tensors. PyTorch originally assumes the input tensors as constant and it is safe for re-entrance, e.g. called by multiple threads from JIT with fork and join. The in-place changes of these input tensors might break this assumption. Do we have the design to protect these in-place changes from multi-threaded access?

pinzhenx · 2020-05-28T13:44:49Z

As of right now, no. Modifying constant tensors on the fly, especially constant parameters, do have some implications we need to take care of.

pinzhenx · 2020-05-28T13:47:32Z

BTW, we are also implementing a standalone prepack jit pass. This PR is kind of a workaround to optimize the imperative path, so that we could achieve the same performance as the to_mkldnn(model) approach.

We may revert this design if customers are more inclined to use JIT, or if something goes wrong with it.

jgong5 · 2020-05-28T15:02:44Z

Use cases for your consideration:

Shared weight by multiple in-process inference models in both imperative and JIT mode.
Shared data by multiple ops.
For 1), pre-packing during model initialization should work fine. But pre-packing on the first model run needs extra care on re-entrance.
For 2), there won't be re-entrance in the imperative path since there is just one model but maybe there is a problem if some downstream ops need blocked layout while others need plain layout. Also, we need to take care of the re-entrance in the JIT mode.

EikanWang · 2020-05-29T02:24:05Z

Thanks, Jiong. I discussed with Pinzhen, for case 1, we can capture module.to and then prepack the weight tensors.

pinzhenx · 2020-05-29T17:10:07Z

We currently adopt prepacking conv weights in module.to

Pros:

thread-safe
more explicit, compared to implicitly prepack at runtime

Cons:

no input info. meaning that queried format might not be optimal. This is not manifest in conv but this could be a problem for linear weights.
unable to re-pack conv weights if it's been reordered back to plain (maybe even not a con)

jgong5 · 2020-05-30T05:57:35Z

no input info. meaning that queried format might not be optimal. This is not manifest in conv but this could be a problem for linear weights.

It will be a problem for conv too if we use winograd. Just FYI.

EikanWang · 2020-06-01T01:54:51Z

intel_pytorch_extension_py/ops/module.py

+ m = orig_module_to(self, *args, **kwargs)
+
+ device = torch._C._nn._parse_to(*args, **kwargs)[0]
+ if device and device.type == 'dpcpp':


We need to check auto_dnnl here. If the user disables auto_dnnl, it should go through the original path.

pinzhenx marked this pull request as draft May 28, 2020 04:47

pinzhenx force-pushed the prepack branch from 82b40e5 to 3ddb0c8 Compare May 28, 2020 04:49

EikanWang reviewed May 28, 2020

View reviewed changes

torch_ipex/csrc/aten_ipex_bridge.cpp Outdated Show resolved Hide resolved

torch_ipex/csrc/cpu/DevOPs.cpp Show resolved Hide resolved

torch_ipex/csrc/cpu/ShadeDataContext.h Show resolved Hide resolved

torch_ipex/csrc/cpu/ShadeDataContext.h Show resolved Hide resolved

pinzhenx force-pushed the prepack branch 3 times, most recently from f193736 to 379ddc1 Compare May 28, 2020 08:39

pinzhenx added 2 commits May 29, 2020 09:23

prepack conv weights

c80d299

mv expand_param_if_needed from pool to common

84c306f

pinzhenx force-pushed the prepack branch from 379ddc1 to a046a6f Compare May 29, 2020 16:48

pinzhenx marked this pull request as ready for review May 29, 2020 16:52

pinzhenx force-pushed the prepack branch 3 times, most recently from a95469e to bf3a18e Compare May 29, 2020 17:00

prepack conv weight in module.to

bf3a18e

EikanWang reviewed Jun 1, 2020

View reviewed changes

EikanWang merged commit 301bd87 into intel:master Jun 1, 2020

EikanWang pushed a commit that referenced this pull request Oct 4, 2021

[LLGA] use decorator to change settings for LLGA UT (#31)

31f3014

gzmkl mentioned this pull request Feb 1, 2022

redundant code #214

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prepack conv weights #31

Prepack conv weights #31

Uh oh!

pinzhenx commented May 28, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jgong5 commented May 28, 2020

pinzhenx commented May 28, 2020

pinzhenx commented May 28, 2020 •

edited

Loading

jgong5 commented May 28, 2020

EikanWang commented May 29, 2020

pinzhenx commented May 29, 2020 •

edited

Loading

jgong5 commented May 30, 2020

EikanWang Jun 1, 2020

Labels

3 participants

Prepack conv weights #31

Prepack conv weights #31

Uh oh!

Conversation

pinzhenx commented May 28, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jgong5 commented May 28, 2020

pinzhenx commented May 28, 2020

pinzhenx commented May 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

jgong5 commented May 28, 2020

EikanWang commented May 29, 2020

pinzhenx commented May 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pros:

Cons:

jgong5 commented May 30, 2020

EikanWang Jun 1, 2020

Choose a reason for hiding this comment

Labels

3 participants

pinzhenx commented May 28, 2020 •

edited

Loading

pinzhenx commented May 29, 2020 •

edited

Loading