Accepted Papers | ICCAD 2024

We are thrilled to announce the ICCAD 2024 accepted paper listing!

The formal notification of acceptance will be sent on June 30th as planned.

ID	Title
510	Quantum State Preparation Circuit Optimization Exploiting Don't Cares
514	A Neural-Ordinary-Differential-Equations Based Generic Approach for Process Modeling in DTCO: A Case Study in Chemical-Mechanical Planarization and Copper Plating
520	ATPlace2.5D: Analytical Thermal-Aware Chiplet Placement Framework for Large-Scale 2.5D-IC
522	RapidIR: A Practical Infrastructure for FPGA High-Level Physical Synthesis
523	HeLEM-GR: Heterogeneous Global Routing with Linearized Exponential Multiplier Method
526	HeteroExcept: A CPU-GPU Heterogeneous Algorithm to Accelerate Exception-aware Static Timing Analysis
527	OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation
529	ZnH2: Augmenting ZNS-based Storage System with Host-managed Heterogeneous Zones
546	Hybrid Power Failure Recovery for Intermittent Computing
550	AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
557	One-for-All: An Unified Learning-based Framework for Efficient Cross-Corner Timing Signoff
568	Automatic Generation of Timing Models from RTL for Hardware Accelerators
571	HybriDIFT: Scalable Memory-Aware Dynamic Information Flow Tracking for Hardware
575	ALISE: Accelerating Large Language Model Serving with Speculative Scheduling
594	AESHA: Accelerating Eigen-decomposition-based Sparse Transformer with Hybrid RRAM-SRAM Architecture
596	PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
598	KirchhoffNet: A Scalable Ultra Fast Analog Neural Network
599	FSMM: An Efficient Matrix Multiplication Accelerator Supporting Flexible Sparsity
602	Joint Placement Optimization for Hierarchical Analog/Mixed-Signal Circuits
609	SeGen: Automatic Topology Generator for Sequencing Elements
614	NAND-Tree: A 3D NAND Flash Based Processing In Memory Accelerator for Tree-Based Models on Large-Scale Tabular Data
615	Edge-BiT: Software-Hardware Co-design for Optimizing Binarized Transformer Networks Inference on Edge FPGA
630	CircuitSeer: RTL Post-PnR Delay Prediction via Coupling Functional and Structural Representation
632	The Power of Graph Signal Processing for Chip Placement Acceleration
650	Fusion of Global Placement and Gate Sizing with Differentiable Optimization
652	R-HLS: An IR for Dynamic High-Level Synthesis and Memory Disambiguation based on Regions and State Edges
656	Beyond the Yield Barrier: Variational Importance Sampling Yield Analysis
658	Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation
662	Robust Implementation of Retrieval-Augmented Generation on Edge-based Computing-in-Memory Architectures
668	GL0AM: GPU Logic Simulation Using 0-Delay and Re-simulation Acceleration Method
670	Physically Aware Synthesis Revisited: Guiding Technology Mapping with Primitive Logic Gate Placement
679	ALISA: An Adaptive Learned Index Structure for Spatial Data on Solid-State Drives
682	On Reducing the Execution Latency of Superconducting Quantum Processors via Quantum Job Scheduling
685	Natural language is not enough: Benchmarking multi-modal generative AI for Verilog generation
687	EasyPart: An Effective and Comprehensive Hypergraph Partitioner for FPGA-based Emulation
693	HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline
699	Multi-Objective Software-Hardware Co-Optimization for HD-PIM via Noise-Aware Bayesian Optimization
702	GAT-Steiner: Rectilinear Steiner Minimal Tree Prediction Using GNNs
705	Bayesian-Informed Hyperdimensional Learning for Intelligent and Efficient Data Processing
707	On the Security Vulnerabilities of MRAM-based In-Memory Computing Architectures against Model Extraction Attacks
715	Word-Level Augmentation of Formal Proof by Learning from Simulation Traces
724	RandOhm: Mitigating Impedance Side-channel Attacks using Randomized Circuit Configurations
728	An Effective Analytical Placement Approach to Handle Fence Region Constraint
730	Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits
734	SysMix: Mixed-Size Placement for Systolic-Array-Based Hierarchical Designs
739	LSMR: Synergy Randomness in Liquid State Machine and RRAM-based Analog-digital Accelerator
741	LACO: A Latency-Constraint Offline Neural Network Scheduler towards Reliable Self-Driving Perception
753	TSO-Flow: A Topology Synthesis and Optimization Workflow for Operational Amplifiers with Invertible Graph Generative Model
755	AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing
757	ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
763	CAMSHAP: Accelerating Machine Learning Model Explainability with Analog CAM
765	ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks
777	An FPGA-based Key-Switching Accelerator with Ultra-High Throughput for FHE
782	TAP-CAM: A Tunable Approximate Matching Engine based on Ferroelectric Content Addressable Memory
789	A Sparsity-Aware Autonomous Path Planning Accelerator with Algorithm-Architecture Co-Design
795	DDP-Fsim: Efficient and Scalable Fault Simulation for Deterministic Patterns with Two-Dimensional Parallelism
796	Improving Timing & Power Trade-off in Post-place Optimization Using Multi-agent Reinforcement Learning
799	CellRejuvo: Rescuing the Aging of 3D NAND Flash Cells with Dense-Sparse Cell Reprogramming
802	PACiM: A Sparsity-Centric Hybrid Compute-in-Memory Architecture via Probabilistic Approximation
803	BPINN-EM: Fast Stochastic Analysis of Electromigration Damage using Bayesian Physics-Informed Neural Networks
804	MAXCell: PPA-Directed Multi-Height Cell Layout Routing Optimization using Anytime MaXSAT with Constraint Learning
805	JigsawPlanner: Jigsaw-like Floorplanner for Eliminating Whitespace and Overlap among Complex Rectilinear Modules
806	Layout-level Hardware Trojan Prevention in the Context of Physical Design
808	StarRoute: Adaptive Compute-Efficient FPGA Routing with Pluggable Intra-Connection Bidirectional Exploration
809	An O(m+n)-Space Spatiotemporal Denoising Filter with Cache-Like Memories for Dynamic Vision Sensors
827	Accelerating Quantum Circuit Simulation with Symbolic Execution and Loop Summarization
828	BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks
830	ARO: Autoregressive Operator Learning for Transferable and Multi-fidelity 3D-IC Thermal Analysis With Active Learning
838	Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural Networks
847	REMNA: Variation-Resilient and Energy-Efficient MLC FeFET Computing-in-Memory Using NAND Flash-Like Read and Adaptive Control
851	A Co-optimization Framework with Multi-layer Constraints for Manufacturability
852	VeriCHERI: Exhaustive Formal Security Verification of CHERI at the RTL
856	GACER: Granularity-Aware ConcurrEncy Regulation for Multi-Tenant Deep Learning
862	A Hardware-Aware Gate Cutting Framework for Practical Quantum Circuit Knitting
867	Multi-phase Coupled CMOS Ring Oscillator based Potts Machine
880	Equivalence Checking for Flow-Based Computing using Iterative SAT Solving
881	SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution
885	Efficient Task Transfer for HLS DSE
893	MapFormer: Attention-based multi-DNN manager for throughout & power co-optimization on embedded devices
894	Enhancing DNN Accelerator Integrity via Selective and Permuted Recomputation
907	HDXpose: Harnessing Hyperdimensional Computing's Explainability for Adversarial Attacks
915	SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator
923	Evolutionary Approximation of Ternary Neurons for On-sensor Printed Neural Networks
927	Foveated HDR: Efficient HDR Content Generation on Edge Devices Leveraging User's Visual Attention
931	Customized Retrieval Augmented Generation and Benchmarking for EDA Tool Documentation QA
942	Enforcing hard constraints in physics-informed learning for transient TSV electromigration analysis
947	RareLS: Rarity-Reducing Logic Synthesis for Mitigating Hardware Trojan Threats
949	Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment
962	DISC: Exploiting Data Parallelism of Non-Stencil Computations on CGRAs via Dynamic Iteration Scheduling
965	Partial Differential Equation Acceleration by Exploiting Value Similarity
966	Revisiting sensitivity-based analog sizing with derivative-aware Bayesian optimization and error-suppressed adjoint analysis
968	FLOP: A Flexible Memory-Optimized Processor for Parallel Graph Mining on FPGA
972	FaStTherm: Fast and Stable Full-Chip Transient Thermal Predictor Considering Nonlinear Effects
973	FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference
974	Hierarchical Power Co-Optimization and Management for LLM Chiplet Designs
977	AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
980	OCTS: An Optical Clock Tree Synthesis Methodology for 2.5D Systems
981	FAS-Trans: Fully Exploiting FFN and Attention Sparsity for Transformer on FPGA
985	RABER: Reliability-Aware Bayesian-Optimization-based Control Layer Escape Routing for Flow-based Microfluidics
986	MORPH: More Robust ASIC Placement for Hybrid Region Constraint Management
988	Is Vanilla Bayesian Optimization Enough for High-Dimensional Architecture Design Optimization?
995	MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
996	DiffSAT: Differential MaxSAT Layer for SAT Solving
998	TransLib: An Extensible Graph-Aware Library Framework for Automated Generation of Transformer Operators on FPGA
999	FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries
1003	ReSCIM: Variation-Resilient High Weight-Loading Bandwidth In-Memory Computation Based on Fine-Grained Hybrid Integration of Multi-Level ReRAM and SRAM Cells
1004	SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image
1005	An Agile Framework for Efficient LLM Accelerator Development and Model Inference
1006	Efficient High-Fidelity Two-Dimensional Warpage Modeling for Advanced Packaging Analysis
1013	Single Instruction Isolation for RISC-V Vector Test Failures
1034	CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural Network
1058	Sustainable High-Performance Instruction Selection for Superscalar Processors
1061	Automatic Verification and Identification of Partial Retention Register Sets for Low-Power Designs
1075	Accelerating Fault Injection for Validating Processor RTL Implementations
1080	Efficient Ultra-Dense 3D IC Power Delivery and Cooling Using 3D Thermal Scaffolding
1082	ChatOPU: An FPGA-based Overlay Processor for Large Language Models with Unstructured Sparsity
1084	MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning Guided Library Tuning
1098	Enabling Robust Inverse Lithography with Rigorous Multi-Objective Optimization
1107	Differentiable Edge-based OPC
1114	An Access Pattern-aware Hybrid Learning-based and Conventional Mapping for Solid-State Drives
1117	ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters
1121	Peak Power and Dynamic IR-drop Assessment via Waveform Augmenting
1130	PulseRF: Physics Augmented ML Modeling and Synthesis for High-Frequency RFIC Design
1140	AGC: A Unified Architecture for Accelerating K-Nearest Neighbor Graph Construction in Vector Search
1148	OFT: An accelerator with eager gradient prediction for attention training
1156	A Processing-using-Memory Architecture for Commodity DRAM Devices with Enhanced Compatibility and Reliability
1166	A Physical and Timing Aware Placement Optimization Framework Based on Graph Neural Network
1169	Leda: Leveraging Tiling Dataflow to Accelerate SpMM on HBM-Equipped FPGAs for GNNs
1179	RISCSparse: Point Cloud Inference Engine on RISC-V Processor
1188	RTLRewriter: Methodologies for Large Models aided RTL Code Optimization
1195	Towards Floating Point-Based Attention-Free LLM: Hybrid PIM with Non-Uniform Data Format and Reduced Multiplications
1196	Sustainable Hardware Specialization
1212	LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer
1218	EPipe: Pipeline Inference Framework with High-quality Offline Parallelism Planning for Heterogeneous Edge Devices
1220	MatFactory: A Framework for High-performance Matrix Factorization on FPGAs
1222	HLSPilot: LLM-based High-Level Synthesis
1223	RankTuner: When Design Tool Parameter Tuning Meets Preference Bayesian Optimization
1226	Potter: A Parallel Overlap-Tolerant Router for UltraScale FPGAs
1232	Optimal Layout Synthesis of Multi-Row Standard Cells for Advanced Technology Nodes
1236	EI-PIT: A Parallel-in-Time Exponential Integrator Method for Transient Linear Circuit Simulation
1237	APINT: A Full-Stack Framework for Acceleration of Privacy-Preserving Inference of Transformers based on Garbled Circuits
1239	InstantGR: Scalable GPU Parallelization for Global Routing
1240	Balor: HLS Source Code Evaluator Based on Custom Graphs and Hierarchical GNNs
1245	Pseudo Adjoint Optimization: Harnessing the Solution Curve for SPICE Acceleration
1248	FlexInt: A New Number Format for Robust Sub-8-Bit Neural Network Inference
1249	DeepGate3: Towards Scalable Circuit Representation Learning
1263	TReCiM: Lower Power and Temperature-Resilient Multibit 2FeFET-1T Compute-in-Memory Design
1265	UFO-MAC: A Unified Framework for Optimization of High-Performance Multipliers and Multiply-Accumulators
1266	CSP: Comprehensive Sparsification Preconditioning for Nonlinear Circuit Simulation
1278	RL-Fill: Timing-Aware Fill Insertion Using Reinforcement Learning
1280	Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization
1288	A Machine Learning Guided Cut Choices for ASIC Technology Mapping
1293	MARCA: Mamba Accelerator with Reconfigurable Architecture
1296	LAG-Sizer: A Novel Gate Sizer Based on Leak Generative Adversarial Network with Feature Fusion
1314	ShiftCAM: A Time-Domain Content Addressable Memory Utilizing Shifted Hamming Distance for Robust Genome Analysis
1315	MEIC: Re-thinking RTL Debug Automation using LLMs
1323	Minimizing Worst-Case Data Transmission Cycles in Wavelength-Routed Optical NoC through Bandwidth Allocation
1329	Hybrid Modeling and Weighting for Timing-driven Placement with Efficient Calibration
1337	Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights
1338	A Hypergraph Partitioner Utilizing a Novel Graph Generative Model
1349	Towards Uncertainty-Quantifiable Biomedical Intelligence: Mixed-signal Compute-in-Entropy for Bayesian Neural Networks
1350	A Framework for Explainable, Comprehensive, and Customizable Memory-Centric Workloads
1357	DoS-FPGA: Denial of Service on Cloud FPGAs via Coordinated Power Hammering
1361	AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge
1386	Multi-Tier 3D SRAM Module Design: Targeting Bit-Line and Word-Line Folding
1387	Detecting Fraudulent Services on Quantum Cloud Platforms via Dynamic Fingerprinting
1397	Neural Architecture Search for Highly Bespoke Robust Printed Neuromorphic Circuits
1403	A Built-In Integrated Rowhammer, Rowpress, and Leakage Detection Sensor for DRAM
1410	FloorSet - a VLSI Floorplanning Dataset with Design Constraints of Real-World SOCs.
1428	Towards Energy-Aware Federated Learning via MARL: A Dual-Selection Approach for Model and Client
1432	ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language Models
1436	AI-Driven Evaluation and Optimization of Bump Pitch Effects on Chiplet and Interposer Design Quality
1440	Modern Fixed-Outline Floorplanning with Rectilinear Soft Modules
1444	SMT-based Layout Synthesis for Silicon-based Quantum Computing with Crossbar Architecture
1467	TSB: Tiny Shared Block for Efficient DNN Deployment on NVCIM Accelerators
1472	ASCENT: Amplifying Power Side-Channel Resilience via Learning & Monte-Carlo Tree Search
1492	LaserEscape: Detecting and Mitigating Optical Probing Attacks
1500	Reinforcement Learning-Enhanced Cloud-Based Open Source Analog Circuit Generator for Standard and Cryogenic Temperatures in 130-nm and 180-nm OpenPDKs
1501	An Effective ECO Methodology for Reducing Back-side Design Rule Violations in Double-sided Signal Routing
1512	Three Guides for Efficient Automatic Post-Fabrication Optimization of Modern NAND Flash Memory
1516	Spiking Transformer Hardware Accelerators in 3D Integration
1521	Accurate, Yet Scalable: A SPICE-based Design and Optimization Framework for eNVM based Analog In-memory Computing
1530	Placement Tomography-Based Routing Blockage Generation for DRV Hotspot Mitigation
1534	Analyzing the Impact of FinFET Self-Heating on the Performance of RF Power Amplifiers
1539	OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection
1551	RACI: A Resource-Aware Cooperative Inference Framework on Heterogeneous Edge Devices
1552	Hyena: Optimizing Homomorphically Encrypted Convolution for Private CNN Inference
1554	TopoOrderPart: a Multi-level Scheduling-Driven Partitioning Framework for Processor-Based Emulation
1560	PolarGate: Breaking the Functionality Representation Bottleneck of And-Inverter Graph Neural Network
1561	CoCoA: Algorithm-Hardware Co-Design for Large-Scale GNN Training using Compressed Graph
1578	eXpect: On the Security Implications of Violations in AXI Implementations
1579	TP-DCIM: Transposable Digital SRAM CIM Architecture for Energy-Efficient and High Throughput Transformer Acceleration
1581	μLAM: A LLM-Powered Assistant for Real-Time Micro-architectural Attack Detection and Mitigation
1590	Explainable and Layout-Aware Timing Prediction
1600	Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD Operations