Simplify dots with 1 #638

ricardoV94 · 2024-02-08T10:30:24Z

Description

We have a local_0_dot_x that removes useless dots with zero'd inputs. We don't seem to have anything for dots with ones as reported in #637 (comment)

import pytensor
import pytensor.tensor as pt
from pytensor.compile.mode import get_default_mode

x = tn.col('x')
f = x @ [[1.]]
with pytensor.config.change_flags(optimizer_verbose=True):
    fn = pytensor.function([x], f, mode=get_default_mode().excluding("BlasOpt"))

pytensor.dprint(fn)

dot [id A] 0
 ├─ x [id B]
 └─ [[1.]] [id C]

I excluded the BlasOpt just to have a simpler graph, but it will still not rewrite it away with those, just add the more complex Blas Op.

pytensor/pytensor/tensor/rewriting/math.py

Lines 155 to 190 in d3dd34e

    
           @register_canonicalize 
        
           @register_stabilize 
        
           @node_rewriter([Dot]) 
        
           def local_0_dot_x(fgraph, node): 
        
               if not isinstance(node.op, Dot): 
        
                   return False 
        
               x = node.inputs[0] 
        
               y = node.inputs[1] 
        
               replace = False 
        
               try: 
        
                   if get_underlying_scalar_constant_value(x, only_process_constants=True) == 0: 
        
                       replace = True 
        
               except NotScalarConstantError: 
        
                   pass 
        
               try: 
        
                   if get_underlying_scalar_constant_value(y, only_process_constants=True) == 0: 
        
                       replace = True 
        
               except NotScalarConstantError: 
        
                   pass 
        
               if replace: 
        
                   constant_zero = constant(0, dtype=node.outputs[0].type.dtype) 
        
                   if x.ndim == 2 and y.ndim == 2: 
        
                       constant_zero = assert_op(constant_zero, eq(x.shape[1], y.shape[0])) 
        
                       return [alloc(constant_zero, x.shape[0], y.shape[1])] 
        
                   elif x.ndim == 1 and y.ndim == 2: 
        
                       constant_zero = assert_op(constant_zero, eq(x.shape[0], y.shape[0])) 
        
                       return [alloc(constant_zero, y.shape[1])] 
        
                   elif x.ndim == 2 and y.ndim == 1: 
        
                       constant_zero = assert_op(constant_zero, eq(x.shape[1], y.shape[0])) 
        
                       return [alloc(constant_zero, x.shape[0])] 
        
                   elif x.ndim == 1 and y.ndim == 1: 
        
                       constant_zero = assert_op(constant_zero, eq(x.shape[0], y.shape[0])) 
        
                       return [constant_zero]

The text was updated successfully, but these errors were encountered:

Dhruvanshu-Joshi · 2024-04-13T07:45:18Z

Looks like an interesting issue. We'd just have to replace 0 with x in the local_0_dot_x right?
Here's what I have in mind:

 @register_canonicalize 
 @register_stabilize 
 @node_rewriter([Dot]) 
 def local_1_dot_x(fgraph, node): 
     if not isinstance(node.op, Dot): 
         return False 
  
     x = node.inputs[0] 
     y = node.inputs[1] 
     replace = False 
     try: 
         if get_underlying_scalar_constant_value(x, only_process_constants=True) == 1: 
             replace = True 
             var = y
     except NotScalarConstantError: 
         pass 
  
     try: 
         if get_underlying_scalar_constant_value(y, only_process_constants=True) == 1: 
             replace = True 
             var=x
     except NotScalarConstantError: 
         pass 
  
     if replace: 
         constant_value = constant(get_underlying_scalar_constant_value(var, only_process_constants=True), dtype=node.outputs[0].type.dtype) 
         if x.ndim == 2 and y.ndim == 2: 
             constant_value = assert_op(constant_value, eq(x.shape[1], y.shape[0])) 
             return [alloc(constant_value, x.shape[0], y.shape[1])] 
         elif x.ndim == 1 and y.ndim == 2: 
             constant_value = assert_op(constant_value, eq(x.shape[0], y.shape[0])) 
             return [alloc(constant_value, y.shape[1])] 
         elif x.ndim == 2 and y.ndim == 1: 
             constant_value = assert_op(constant_value, eq(x.shape[1], y.shape[0])) 
             return [alloc(constant_value, x.shape[0])] 
         elif x.ndim == 1 and y.ndim == 1: 
             constant_value = assert_op(constant_value, eq(x.shape[0], y.shape[0])) 
             return [constant_value]

However, I think using constant value might be wrong here. Will I have to replace with the entire var itself? If yes, then is this the correct way of moving forward?

var=assert_op(var,  eq(...)
alloc(var, shape)

ricardoV94 · 2024-04-13T07:57:19Z

No, the rule is slightly different for ones, as it consists of summing the left matrix. Also have to reason about broadcasting.

I suggest playing with numpy to get a feel of what it should do.

Dhruvanshu-Joshi · 2024-04-13T10:00:26Z

Ohk.
Just so that I get it correctly, for a given graph say

Sub [id A]
 ├─ dot [id B]
 │  ├─ dot [id C]
 │  │  ├─ Transpose{axes=[1, 0]} [id D] 'A.T'
 │  │  │  └─ A [id E]
 │  │  └─ Neg [id F]
 │  │     └─ x [id G]
 │  └─ [[1.]] [id H]
 └─ dot [id I]
    ├─ A [id E]
    └─ dot [id J]
       ├─ x [id G]
       └─ [[1.]] [id H]

we want the output of the rewrite to be:

Sub [id A]
 ├─ dot [id B]
 │  ├─ Transpose{axes=[1, 0]} [id C] 'A.T'
 │  │  └─ A [id D]
 │  └─ Neg [id E]
 │     └─ x [id F]
 └─ dot [id G]
    ├─ A [id D]
    └─ x [id F]

Is this correct? And if yes, how does summing of left matrices and broadcasting come into picture here?

ricardoV94 added beginner friendly graph rewriting labels Feb 8, 2024

Dhruvanshu-Joshi linked a pull request Jun 7, 2024 that will close this issue

Replace dot of 1 and x -> x #810

Draft

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify dots with 1 #638

Simplify dots with 1 #638

ricardoV94 commented Feb 8, 2024 •

edited

Loading

Dhruvanshu-Joshi commented Apr 13, 2024

Uh oh!

ricardoV94 commented Apr 13, 2024

Uh oh!

Dhruvanshu-Joshi commented Apr 13, 2024

Uh oh!

Simplify dots with 1 #638

Simplify dots with 1 #638

Comments

ricardoV94 commented Feb 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Dhruvanshu-Joshi commented Apr 13, 2024

Uh oh!

ricardoV94 commented Apr 13, 2024

Uh oh!

Dhruvanshu-Joshi commented Apr 13, 2024

Uh oh!

ricardoV94 commented Feb 8, 2024 •

edited

Loading