Nov 7 – 10, 2023
SLAC
America/Los_Angeles timezone

Exploration of Resource-efficient Implementations of ML Models Targeting eFPGAs

Nov 8, 2023, 1:50 PM
15m
53/1-1350-A - Trinity-A (SLAC)

53/1-1350-A - Trinity-A

SLAC

65
Oral RDC4: Readout and ASICs RDC4

Speaker

Jyothisraj Johnson (Lawrence Berkeley National Laboratory (LBNL))

Description

Implementing machine learning (ML) models in hardware has received considerable interest over the last several years from the physics community. The Python packages, hls4ml and conifer, has enabled porting models trained using Python ML libraries to register transfer level (RTL) code. Most of the attention, thus far, has been focused on porting ML models to commercial FPGAs or synthesized blocks on ASICs. With the latter, a (physical) area-optimized implementation of a ML model can be integrated on-chip. The reduction in area generally results in reduced costs for chip fabrication. The usual trade-off with an ASIC implementation is the inability to update model architecture post-synthesis. However, updating of biases/weights has been demonstrated at an additional area cost with techniques such as distributed I2C networks. Regardless, recent developments in open-source embedded FPGA (eFPGA) frameworks now provide an alternate and more flexible pathway for implementing ML models in hardware: customized eFPGA fabrics, which can also be integrated as part of an overall chip design. In general, the decision between an ASIC or eFPGA ML implementation will depend on the target application. We explored the design parameter space for eFPGA implementations of fully connected neural network (fc-NN) and boosted decision tree (BDT) models using the classification task of neutron/gamma identification, with a specific focus on resource efficiency. We used training data from an AmBe sealed source incident on a plastic scintillator read out by SiPMs. We studied relevant input features, the required bit-resolution, sampling rate and trade-offs in hyperparameters for both ML models while tracking resource usage and neutron efficiency at a gamma leakage of 10^-3. The results of the study will be used in the specification of an eFPGA fabric, which will be integrated as part of a 130 nm test chip next year.

Early Career Yes

Primary author

Jyothisraj Johnson (Lawrence Berkeley National Laboratory (LBNL))

Co-authors

Dr Billy Boxer (UC Davis) Dr Carl Grace (LBNL) Dr Tarun Prakash (LBNL)

Presentation materials