@@ -26,7 +26,7 @@ If the code that is not symbolically traceable does not need to be quantized, we
26
26
to run FX Graph Mode Quantization:
27
27
28
28
29
- a. Symbolically trace only the code that needs to be quantized
29
+ Symbolically trace only the code that needs to be quantized
30
30
-----------------------------------------------------------------
31
31
When the whole model is not symbolically traceable but the submodule we want to quantize is
32
32
symbolically traceable, we can run quantization only on that submodule.
@@ -71,7 +71,7 @@ Note if original model needs to be preserved, you will have to
71
71
copy it yourself before calling the quantization APIs.
72
72
73
73
74
- b. Skip symbolically trace the non-traceable code
74
+ Skip symbolically trace the non-traceable code
75
75
---------------------------------------------------
76
76
When we have some non-traceable code in the module, and this part of code doesn’t need to be quantized,
77
77
we can factor out this part of the code into a submodule and skip symbolically trace that submodule.
@@ -135,7 +135,7 @@ quantization code:
135
135
136
136
If the code that is not symbolically traceable needs to be quantized, we have the following two options:
137
137
138
- a. Refactor your code to make it symbolically traceable
138
+ Refactor your code to make it symbolically traceable
139
139
--------------------------------------------------------
140
140
If it is easy to refactor the code and make the code symbolically traceable,
141
141
we can refactor the code and remove the use of non-traceable constructs in python.
@@ -174,7 +174,7 @@ depends on the model.
174
174
175
175
176
176
177
- b. Write your own observed and quantized submodule
177
+ Write your own observed and quantized submodule
178
178
-----------------------------------------------------
179
179
180
180
If the non-traceable code can’t be refactored to be symbolically traceable,
0 commit comments