从Python循环嵌套的FPGA应用专用加速器自动生成
Automatic Generation Of Application-Specific Accelerators for FPGAs from Python Loop Nests
关键词:重排序;Python循环巢;优化循环体
摘 要:We present Three Fingered Jack, a highly productive approach to mapping vectorizable applications to the FPGA. Our system applies traditional dependence analysis and reordering transformations to a restricted set of Python loop nests. It does this to uncover parallelism and divide computation between multiple parallel processing elements (PEs) that are automatically generated through high-level synthesis of the optimized loop body.