Skip to content

Commit 2e3939c

Browse files
committed
1. Changing comments in the arch files based on review feedback.
2. Adding a new test design that we can use in the vtr_reg_strong regression test. 3. Moved all tests from vtr_reg_strong to vtr_reg_nighty. The time taken by the tests in vtr_reg_strong was more than we want. 4. Updated the list of contributors to include Helen and Zach who worked on DLA and CLSTM designs in the past.
1 parent a6ff8c8 commit 2e3939c

19 files changed

+831
-166
lines changed

vtr_flow/arch/COFFE_22nm/k6FracN10LB_mem20K_complexDSP_customSB_22nm.clustered.denser.xml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
<!--
2-
This is the architecture file for an Agilex-like Architecture, based off
3-
the Stratix-10-like Architecture discussed in [1].
2+
This is the architecture file for a modern Intel FPGA. The blocks (logic, RAM, DSP)
3+
are Agilex-like, but the routing architecture is similar to Stratix IV. It is based
4+
off the Stratix-10-like Architecture discussed in [1], the Agilex-like Architecture
5+
mentioned in [6] and Stratix-IV-like Architecture mentioned in [5].
46
57
The delays and areas of various components in this arch come from COFFE [2]
68
runs using a 22nm technology node [3].
@@ -132,23 +134,24 @@
132134
I/O pads are arranged along the perimeter of the FPGA. No area values provided for the I/Os.
133135
134136
##############################
135-
What makes this architecture Agilex-like?
137+
Comments on similarities and differences with Intel FPGA architecture.
136138
##############################
139+
The main parameters of the logic blocks, DSPs and RAMs are similar to Intel FPGAs. But here are
140+
some important points:
137141
1. The DSP slice supports lower precision modes - int8 (actually 9x9) and 16-bit floating point
138-
(IEEE half-precision and bfloat16).
142+
(IEEE half-precision and bfloat16). These modes are present in Intel Agilex FPGA DSPs.
139143
2. DSPs are chained in vertical direction (currently only chainin-chainout are supported;
140-
in the future, scanin-scanout connections will be added as well)
141-
3. Instead of having wires of just one length in the entire FPGA, there are wire segments of L=4
144+
in the future, scanin-scanout connections will be added as well). This is a common feature
145+
in modern FPGAs.
146+
3. There are no registers on the interconnect/routing wires in this architecture. That is a main
147+
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel).
148+
4. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
149+
Most modern Intel FPGAs have sector based layout.
150+
5. The IOs are on the perimeter, instead of being arranged in columns. Modern FPGAs arrange I/Os in
151+
columns.
152+
6. The routing architecture is similar to Stratix IV. There are wire segments of L=4
142153
and L=16. And a custom switch pattern (not a standard wilton switch) is used.
143154
144-
##############################
145-
What makes this architecture different from Agilex and Stratix10?
146-
##############################
147-
1. There are no registers on the interconnect/routing wires in this architecture. That is a main
148-
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel)
149-
2. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
150-
3. The IOs are on the perimter, instead of being arranged in columns.
151-
152155
[1] M. Eldafrawy, A. Boutros, S. Yazdanshenas, and V. Betz, "FPGA Logic Block Architectures for
153156
Efficient Deep Learning Inference" in ACM TRETS, 2020
154157
[2] S. Yazdanshenas, and V. Betz, "COFFE 2: Automatic Modelling and Optimization of
@@ -159,6 +162,7 @@
159162
performance from 180 nm to 7 nm" in Integration, the VLSI Journal (2017)
160163
[5] K. E. Murray et al., “Timing-Driven Titan: Enabling Large Benchmarks and Exploring the Gap between
161164
Academic and Commercial CAD,” TRETS 2015.
165+
[6] A. Arora et al., "Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs", ISFPGA 2020.
162166
-->
163167

164168
<architecture>

vtr_flow/arch/COFFE_22nm/k6FracN10LB_mem20K_complexDSP_customSB_22nm.clustered.densest.xml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
<!--
2-
This is the architecture file for an Agilex-like Architecture, based off
3-
the Stratix-10-like Architecture discussed in [1].
2+
This is the architecture file for a modern Intel FPGA. The blocks (logic, RAM, DSP)
3+
are Agilex-like, but the routing architecture is similar to Stratix IV. It is based
4+
off the Stratix-10-like Architecture discussed in [1], the Agilex-like Architecture
5+
mentioned in [6] and Stratix-IV-like Architecture mentioned in [5].
46
57
The delays and areas of various components in this arch come from COFFE [2]
68
runs using a 22nm technology node [3].
@@ -132,23 +134,24 @@
132134
I/O pads are arranged along the perimeter of the FPGA. No area values provided for the I/Os.
133135
134136
##############################
135-
What makes this architecture Agilex-like?
137+
Comments on similarities and differences with Intel FPGA architecture.
136138
##############################
139+
The main parameters of the logic blocks, DSPs and RAMs are similar to Intel FPGAs. But here are
140+
some important points:
137141
1. The DSP slice supports lower precision modes - int8 (actually 9x9) and 16-bit floating point
138-
(IEEE half-precision and bfloat16).
142+
(IEEE half-precision and bfloat16). These modes are present in Intel Agilex FPGA DSPs.
139143
2. DSPs are chained in vertical direction (currently only chainin-chainout are supported;
140-
in the future, scanin-scanout connections will be added as well)
141-
3. Instead of having wires of just one length in the entire FPGA, there are wire segments of L=4
144+
in the future, scanin-scanout connections will be added as well). This is a common feature
145+
in modern FPGAs.
146+
3. There are no registers on the interconnect/routing wires in this architecture. That is a main
147+
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel).
148+
4. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
149+
Most modern Intel FPGAs have sector based layout.
150+
5. The IOs are on the perimeter, instead of being arranged in columns. Modern FPGAs arrange I/Os in
151+
columns.
152+
6. The routing architecture is similar to Stratix IV. There are wire segments of L=4
142153
and L=16. And a custom switch pattern (not a standard wilton switch) is used.
143154
144-
##############################
145-
What makes this architecture different from Agilex and Stratix10?
146-
##############################
147-
1. There are no registers on the interconnect/routing wires in this architecture. That is a main
148-
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel)
149-
2. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
150-
3. The IOs are on the perimter, instead of being arranged in columns.
151-
152155
[1] M. Eldafrawy, A. Boutros, S. Yazdanshenas, and V. Betz, "FPGA Logic Block Architectures for
153156
Efficient Deep Learning Inference" in ACM TRETS, 2020
154157
[2] S. Yazdanshenas, and V. Betz, "COFFE 2: Automatic Modelling and Optimization of
@@ -159,6 +162,7 @@
159162
performance from 180 nm to 7 nm" in Integration, the VLSI Journal (2017)
160163
[5] K. E. Murray et al., “Timing-Driven Titan: Enabling Large Benchmarks and Exploring the Gap between
161164
Academic and Commercial CAD,” TRETS 2015.
165+
[6] A. Arora et al., "Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs", ISFPGA 2020.
162166
-->
163167

164168
<architecture>

vtr_flow/arch/COFFE_22nm/k6FracN10LB_mem20K_complexDSP_customSB_22nm.clustered.xml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
<!--
2-
This is the architecture file for an Agilex-like Architecture, based off
3-
the Stratix-10-like Architecture discussed in [1].
2+
This is the architecture file for a modern Intel FPGA. The blocks (logic, RAM, DSP)
3+
are Agilex-like, but the routing architecture is similar to Stratix IV. It is based
4+
off the Stratix-10-like Architecture discussed in [1], the Agilex-like Architecture
5+
mentioned in [6] and Stratix-IV-like Architecture mentioned in [5].
46
57
The delays and areas of various components in this arch come from COFFE [2]
68
runs using a 22nm technology node [3].
@@ -132,23 +134,24 @@
132134
I/O pads are arranged along the perimeter of the FPGA. No area values provided for the I/Os.
133135
134136
##############################
135-
What makes this architecture Agilex-like?
137+
Comments on similarities and differences with Intel FPGA architecture.
136138
##############################
139+
The main parameters of the logic blocks, DSPs and RAMs are similar to Intel FPGAs. But here are
140+
some important points:
137141
1. The DSP slice supports lower precision modes - int8 (actually 9x9) and 16-bit floating point
138-
(IEEE half-precision and bfloat16).
142+
(IEEE half-precision and bfloat16). These modes are present in Intel Agilex FPGA DSPs.
139143
2. DSPs are chained in vertical direction (currently only chainin-chainout are supported;
140-
in the future, scanin-scanout connections will be added as well)
141-
3. Instead of having wires of just one length in the entire FPGA, there are wire segments of L=4
144+
in the future, scanin-scanout connections will be added as well). This is a common feature
145+
in modern FPGAs.
146+
3. There are no registers on the interconnect/routing wires in this architecture. That is a main
147+
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel).
148+
4. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
149+
Most modern Intel FPGAs have sector based layout.
150+
5. The IOs are on the perimeter, instead of being arranged in columns. Modern FPGAs arrange I/Os in
151+
columns.
152+
6. The routing architecture is similar to Stratix IV. There are wire segments of L=4
142153
and L=16. And a custom switch pattern (not a standard wilton switch) is used.
143154
144-
##############################
145-
What makes this architecture different from Agilex and Stratix10?
146-
##############################
147-
1. There are no registers on the interconnect/routing wires in this architecture. That is a main
148-
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel)
149-
2. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
150-
3. The IOs are on the perimter, instead of being arranged in columns.
151-
152155
[1] M. Eldafrawy, A. Boutros, S. Yazdanshenas, and V. Betz, "FPGA Logic Block Architectures for
153156
Efficient Deep Learning Inference" in ACM TRETS, 2020
154157
[2] S. Yazdanshenas, and V. Betz, "COFFE 2: Automatic Modelling and Optimization of
@@ -159,6 +162,7 @@
159162
performance from 180 nm to 7 nm" in Integration, the VLSI Journal (2017)
160163
[5] K. E. Murray et al., “Timing-Driven Titan: Enabling Large Benchmarks and Exploring the Gap between
161164
Academic and Commercial CAD,” TRETS 2015.
165+
[6] A. Arora et al., "Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs", ISFPGA 2020.
162166
-->
163167

164168
<architecture>

vtr_flow/arch/COFFE_22nm/k6FracN10LB_mem20K_complexDSP_customSB_22nm.coupled.denser.xml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
<!--
2-
This is the architecture file for an Agilex-like Architecture, based off
3-
the Stratix-10-like Architecture discussed in [1].
2+
This is the architecture file for a modern Intel FPGA. The blocks (logic, RAM, DSP)
3+
are Agilex-like, but the routing architecture is similar to Stratix IV. It is based
4+
off the Stratix-10-like Architecture discussed in [1], the Agilex-like Architecture
5+
mentioned in [6] and Stratix-IV-like Architecture mentioned in [5].
46
57
The delays and areas of various components in this arch come from COFFE [2]
68
runs using a 22nm technology node [3].
@@ -132,23 +134,24 @@
132134
I/O pads are arranged along the perimeter of the FPGA. No area values provided for the I/Os.
133135
134136
##############################
135-
What makes this architecture Agilex-like?
137+
Comments on similarities and differences with Intel FPGA architecture.
136138
##############################
139+
The main parameters of the logic blocks, DSPs and RAMs are similar to Intel FPGAs. But here are
140+
some important points:
137141
1. The DSP slice supports lower precision modes - int8 (actually 9x9) and 16-bit floating point
138-
(IEEE half-precision and bfloat16).
142+
(IEEE half-precision and bfloat16). These modes are present in Intel Agilex FPGA DSPs.
139143
2. DSPs are chained in vertical direction (currently only chainin-chainout are supported;
140-
in the future, scanin-scanout connections will be added as well)
141-
3. Instead of having wires of just one length in the entire FPGA, there are wire segments of L=4
144+
in the future, scanin-scanout connections will be added as well). This is a common feature
145+
in modern FPGAs.
146+
3. There are no registers on the interconnect/routing wires in this architecture. That is a main
147+
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel).
148+
4. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
149+
Most modern Intel FPGAs have sector based layout.
150+
5. The IOs are on the perimeter, instead of being arranged in columns. Modern FPGAs arrange I/Os in
151+
columns.
152+
6. The routing architecture is similar to Stratix IV. There are wire segments of L=4
142153
and L=16. And a custom switch pattern (not a standard wilton switch) is used.
143154
144-
##############################
145-
What makes this architecture different from Agilex and Stratix10?
146-
##############################
147-
1. There are no registers on the interconnect/routing wires in this architecture. That is a main
148-
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel)
149-
2. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
150-
3. The IOs are on the perimter, instead of being arranged in columns.
151-
152155
[1] M. Eldafrawy, A. Boutros, S. Yazdanshenas, and V. Betz, "FPGA Logic Block Architectures for
153156
Efficient Deep Learning Inference" in ACM TRETS, 2020
154157
[2] S. Yazdanshenas, and V. Betz, "COFFE 2: Automatic Modelling and Optimization of
@@ -159,6 +162,7 @@
159162
performance from 180 nm to 7 nm" in Integration, the VLSI Journal (2017)
160163
[5] K. E. Murray et al., “Timing-Driven Titan: Enabling Large Benchmarks and Exploring the Gap between
161164
Academic and Commercial CAD,” TRETS 2015.
165+
[6] A. Arora et al., "Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs", ISFPGA 2020.
162166
-->
163167

164168
<architecture>

vtr_flow/arch/COFFE_22nm/k6FracN10LB_mem20K_complexDSP_customSB_22nm.coupled.densest.xml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
<!--
2-
This is the architecture file for an Agilex-like Architecture, based off
3-
the Stratix-10-like Architecture discussed in [1].
2+
This is the architecture file for a modern Intel FPGA. The blocks (logic, RAM, DSP)
3+
are Agilex-like, but the routing architecture is similar to Stratix IV. It is based
4+
off the Stratix-10-like Architecture discussed in [1], the Agilex-like Architecture
5+
mentioned in [6] and Stratix-IV-like Architecture mentioned in [5].
46
57
The delays and areas of various components in this arch come from COFFE [2]
68
runs using a 22nm technology node [3].
@@ -132,23 +134,24 @@
132134
I/O pads are arranged along the perimeter of the FPGA. No area values provided for the I/Os.
133135
134136
##############################
135-
What makes this architecture Agilex-like?
137+
Comments on similarities and differences with Intel FPGA architecture.
136138
##############################
139+
The main parameters of the logic blocks, DSPs and RAMs are similar to Intel FPGAs. But here are
140+
some important points:
137141
1. The DSP slice supports lower precision modes - int8 (actually 9x9) and 16-bit floating point
138-
(IEEE half-precision and bfloat16).
142+
(IEEE half-precision and bfloat16). These modes are present in Intel Agilex FPGA DSPs.
139143
2. DSPs are chained in vertical direction (currently only chainin-chainout are supported;
140-
in the future, scanin-scanout connections will be added as well)
141-
3. Instead of having wires of just one length in the entire FPGA, there are wire segments of L=4
144+
in the future, scanin-scanout connections will be added as well). This is a common feature
145+
in modern FPGAs.
146+
3. There are no registers on the interconnect/routing wires in this architecture. That is a main
147+
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel).
148+
4. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
149+
Most modern Intel FPGAs have sector based layout.
150+
5. The IOs are on the perimeter, instead of being arranged in columns. Modern FPGAs arrange I/Os in
151+
columns.
152+
6. The routing architecture is similar to Stratix IV. There are wire segments of L=4
142153
and L=16. And a custom switch pattern (not a standard wilton switch) is used.
143154
144-
##############################
145-
What makes this architecture different from Agilex and Stratix10?
146-
##############################
147-
1. There are no registers on the interconnect/routing wires in this architecture. That is a main
148-
feature in the Stratix10 and Agilex families of Intel FPGA (it's called HyperFlex by Intel)
149-
2. The architecture doesn't have sectors. All blocks are laid out in columns on the entire chip.
150-
3. The IOs are on the perimter, instead of being arranged in columns.
151-
152155
[1] M. Eldafrawy, A. Boutros, S. Yazdanshenas, and V. Betz, "FPGA Logic Block Architectures for
153156
Efficient Deep Learning Inference" in ACM TRETS, 2020
154157
[2] S. Yazdanshenas, and V. Betz, "COFFE 2: Automatic Modelling and Optimization of
@@ -159,6 +162,7 @@
159162
performance from 180 nm to 7 nm" in Integration, the VLSI Journal (2017)
160163
[5] K. E. Murray et al., “Timing-Driven Titan: Enabling Large Benchmarks and Exploring the Gap between
161164
Academic and Commercial CAD,” TRETS 2015.
165+
[6] A. Arora et al., "Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs", ISFPGA 2020.
162166
-->
163167

164168
<architecture>

0 commit comments

Comments
 (0)