We are thrilled to announce the ICCAD 2024 accepted paper listing!
The formal notification of acceptance will be sent on June 30th as planned.
ID | Title |
|---|---|
| 510 | Quantum State Preparation Circuit Optimization Exploiting Don't Cares |
| 514 | A Neural-Ordinary-Differential-Equations Based Generic Approach for Process Modeling in DTCO: A Case Study in Chemical-Mechanical Planarization and Copper Plating |
| 520 | ATPlace2.5D: Analytical Thermal-Aware Chiplet Placement Framework for Large-Scale 2.5D-IC |
| 522 | RapidIR: A Practical Infrastructure for FPGA High-Level Physical Synthesis |
| 523 | HeLEM-GR: Heterogeneous Global Routing with Linearized Exponential Multiplier Method |
| 526 | HeteroExcept: A CPU-GPU Heterogeneous Algorithm to Accelerate Exception-aware Static Timing Analysis |
| 527 | OSCA: End-to-end Serial Stochastic Computing Neural Acceleration with Fine-grained Scaling and Piecewise Activation |
| 529 | ZnH2: Augmenting ZNS-based Storage System with Host-managed Heterogeneous Zones |
| 546 | Hybrid Power Failure Recovery for Intermittent Computing |
| 550 | AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge |
| 557 | One-for-All: An Unified Learning-based Framework for Efficient Cross-Corner Timing Signoff |
| 568 | Automatic Generation of Timing Models from RTL for Hardware Accelerators |
| 571 | HybriDIFT: Scalable Memory-Aware Dynamic Information Flow Tracking for Hardware |
| 575 | ALISE: Accelerating Large Language Model Serving with Speculative Scheduling |
| 594 | AESHA: Accelerating Eigen-decomposition-based Sparse Transformer with Hybrid RRAM-SRAM Architecture |
| 596 | PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization |
| 598 | KirchhoffNet: A Scalable Ultra Fast Analog Neural Network |
| 599 | FSMM: An Efficient Matrix Multiplication Accelerator Supporting Flexible Sparsity |
| 602 | Joint Placement Optimization for Hierarchical Analog/Mixed-Signal Circuits |
| 609 | SeGen: Automatic Topology Generator for Sequencing Elements |
| 614 | NAND-Tree: A 3D NAND Flash Based Processing In Memory Accelerator for Tree-Based Models on Large-Scale Tabular Data |
| 615 | Edge-BiT: Software-Hardware Co-design for Optimizing Binarized Transformer Networks Inference on Edge FPGA |
| 630 | CircuitSeer: RTL Post-PnR Delay Prediction via Coupling Functional and Structural Representation |
| 632 | The Power of Graph Signal Processing for Chip Placement Acceleration |
| 650 | Fusion of Global Placement and Gate Sizing with Differentiable Optimization |
| 652 | R-HLS: An IR for Dynamic High-Level Synthesis and Memory Disambiguation based on Regions and State Edges |
| 656 | Beyond the Yield Barrier: Variational Importance Sampling Yield Analysis |
| 658 | Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation |
| 662 | Robust Implementation of Retrieval-Augmented Generation on Edge-based Computing-in-Memory Architectures |
| 668 | GL0AM: GPU Logic Simulation Using 0-Delay and Re-simulation Acceleration Method |
| 670 | Physically Aware Synthesis Revisited: Guiding Technology Mapping with Primitive Logic Gate Placement |
| 679 | ALISA: An Adaptive Learned Index Structure for Spatial Data on Solid-State Drives |
| 682 | On Reducing the Execution Latency of Superconducting Quantum Processors via Quantum Job Scheduling |
| 685 | Natural language is not enough: Benchmarking multi-modal generative AI for Verilog generation |
| 687 | EasyPart: An Effective and Comprehensive Hypergraph Partitioner for FPGA-based Emulation |
| 693 | HG-PIPE: Vision Transformer Acceleration with Hybrid-Grained Pipeline |
| 699 | Multi-Objective Software-Hardware Co-Optimization for HD-PIM via Noise-Aware Bayesian Optimization |
| 702 | GAT-Steiner: Rectilinear Steiner Minimal Tree Prediction Using GNNs |
| 705 | Bayesian-Informed Hyperdimensional Learning for Intelligent and Efficient Data Processing |
| 707 | On the Security Vulnerabilities of MRAM-based In-Memory Computing Architectures against Model Extraction Attacks |
| 715 | Word-Level Augmentation of Formal Proof by Learning from Simulation Traces |
| 724 | RandOhm: Mitigating Impedance Side-channel Attacks using Randomized Circuit Configurations |
| 728 | An Effective Analytical Placement Approach to Handle Fence Region Constraint |
| 730 | Barber: Balancing Thermal Relaxation Deviations of NISQ Programs by Exploiting Bit-Inverted Circuits |
| 734 | SysMix: Mixed-Size Placement for Systolic-Array-Based Hierarchical Designs |
| 739 | LSMR: Synergy Randomness in Liquid State Machine and RRAM-based Analog-digital Accelerator |
| 741 | LACO: A Latency-Constraint Offline Neural Network Scheduler towards Reliable Self-Driving Perception |
| 753 | TSO-Flow: A Topology Synthesis and Optimization Workflow for Operational Amplifiers with Invertible Graph Generative Model |
| 755 | AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing |
| 757 | ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding |
| 763 | CAMSHAP: Accelerating Machine Learning Model Explainability with Analog CAM |
| 765 | ReCon: Reconfiguring Analog Rydberg Atom Quantum Computers for Quantum Generative Adversarial Networks |
| 777 | An FPGA-based Key-Switching Accelerator with Ultra-High Throughput for FHE |
| 782 | TAP-CAM: A Tunable Approximate Matching Engine based on Ferroelectric Content Addressable Memory |
| 789 | A Sparsity-Aware Autonomous Path Planning Accelerator with Algorithm-Architecture Co-Design |
| 795 | DDP-Fsim: Efficient and Scalable Fault Simulation for Deterministic Patterns with Two-Dimensional Parallelism |
| 796 | Improving Timing & Power Trade-off in Post-place Optimization Using Multi-agent Reinforcement Learning |
| 799 | CellRejuvo: Rescuing the Aging of 3D NAND Flash Cells with Dense-Sparse Cell Reprogramming |
| 802 | PACiM: A Sparsity-Centric Hybrid Compute-in-Memory Architecture via Probabilistic Approximation |
| 803 | BPINN-EM: Fast Stochastic Analysis of Electromigration Damage using Bayesian Physics-Informed Neural Networks |
| 804 | MAXCell: PPA-Directed Multi-Height Cell Layout Routing Optimization using Anytime MaXSAT with Constraint Learning |
| 805 | JigsawPlanner: Jigsaw-like Floorplanner for Eliminating Whitespace and Overlap among Complex Rectilinear Modules |
| 806 | Layout-level Hardware Trojan Prevention in the Context of Physical Design |
| 808 | StarRoute: Adaptive Compute-Efficient FPGA Routing with Pluggable Intra-Connection Bidirectional Exploration |
| 809 | An O(m+n)-Space Spatiotemporal Denoising Filter with Cache-Like Memories for Dynamic Vision Sensors |
| 827 | Accelerating Quantum Circuit Simulation with Symbolic Execution and Loop Summarization |
| 828 | BasisN: Reprogramming-Free RRAM-Based In-Memory-Computing by Basis Combination for Deep Neural Networks |
| 830 | ARO: Autoregressive Operator Learning for Transferable and Multi-fidelity 3D-IC Thermal Analysis With Active Learning |
| 838 | Voxel-CIM: An Efficient Compute-in-Memory Accelerator for Voxel-based Point Cloud Neural Networks |
| 847 | REMNA: Variation-Resilient and Energy-Efficient MLC FeFET Computing-in-Memory Using NAND Flash-Like Read and Adaptive Control |
| 851 | A Co-optimization Framework with Multi-layer Constraints for Manufacturability |
| 852 | VeriCHERI: Exhaustive Formal Security Verification of CHERI at the RTL |
| 856 | GACER: Granularity-Aware ConcurrEncy Regulation for Multi-Tenant Deep Learning |
| 862 | A Hardware-Aware Gate Cutting Framework for Practical Quantum Circuit Knitting |
| 867 | Multi-phase Coupled CMOS Ring Oscillator based Potts Machine |
| 880 | Equivalence Checking for Flow-Based Computing using Iterative SAT Solving |
| 881 | SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution |
| 885 | Efficient Task Transfer for HLS DSE |
| 893 | MapFormer: Attention-based multi-DNN manager for throughout & power co-optimization on embedded devices |
| 894 | Enhancing DNN Accelerator Integrity via Selective and Permuted Recomputation |
| 907 | HDXpose: Harnessing Hyperdimensional Computing's Explainability for Adversarial Attacks |
| 915 | SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator |
| 923 | Evolutionary Approximation of Ternary Neurons for On-sensor Printed Neural Networks |
| 927 | Foveated HDR: Efficient HDR Content Generation on Edge Devices Leveraging User's Visual Attention |
| 931 | Customized Retrieval Augmented Generation and Benchmarking for EDA Tool Documentation QA |
| 942 | Enforcing hard constraints in physics-informed learning for transient TSV electromigration analysis |
| 947 | RareLS: Rarity-Reducing Logic Synthesis for Mitigating Hardware Trojan Threats |
| 949 | Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment |
| 962 | DISC: Exploiting Data Parallelism of Non-Stencil Computations on CGRAs via Dynamic Iteration Scheduling |
| 965 | Partial Differential Equation Acceleration by Exploiting Value Similarity |
| 966 | Revisiting sensitivity-based analog sizing with derivative-aware Bayesian optimization and error-suppressed adjoint analysis |
| 968 | FLOP: A Flexible Memory-Optimized Processor for Parallel Graph Mining on FPGA |
| 972 | FaStTherm: Fast and Stable Full-Chip Transient Thermal Predictor Considering Nonlinear Effects |
| 973 | FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference |
| 974 | Hierarchical Power Co-Optimization and Management for LLM Chiplet Designs |
| 977 | AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference |
| 980 | OCTS: An Optical Clock Tree Synthesis Methodology for 2.5D Systems |
| 981 | FAS-Trans: Fully Exploiting FFN and Attention Sparsity for Transformer on FPGA |
| 985 | RABER: Reliability-Aware Bayesian-Optimization-based Control Layer Escape Routing for Flow-based Microfluidics |
| 986 | MORPH: More Robust ASIC Placement for Hybrid Region Constraint Management |
| 988 | Is Vanilla Bayesian Optimization Enough for High-Dimensional Architecture Design Optimization? |
| 995 | MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers |
| 996 | DiffSAT: Differential MaxSAT Layer for SAT Solving |
| 998 | TransLib: An Extensible Graph-Aware Library Framework for Automated Generation of Transformer Operators on FPGA |
| 999 | FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries |
| 1003 | ReSCIM: Variation-Resilient High Weight-Loading Bandwidth In-Memory Computation Based on Fine-Grained Hybrid Integration of Multi-Level ReRAM and SRAM Cells |
| 1004 | SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image |
| 1005 | An Agile Framework for Efficient LLM Accelerator Development and Model Inference |
| 1006 | Efficient High-Fidelity Two-Dimensional Warpage Modeling for Advanced Packaging Analysis |
| 1013 | Single Instruction Isolation for RISC-V Vector Test Failures |
| 1034 | CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural Network |
| 1058 | Sustainable High-Performance Instruction Selection for Superscalar Processors |
| 1061 | Automatic Verification and Identification of Partial Retention Register Sets for Low-Power Designs |
| 1075 | Accelerating Fault Injection for Validating Processor RTL Implementations |
| 1080 | Efficient Ultra-Dense 3D IC Power Delivery and Cooling Using 3D Thermal Scaffolding |
| 1082 | ChatOPU: An FPGA-based Overlay Processor for Large Language Models with Unstructured Sparsity |
| 1084 | MapTune: Advancing ASIC Technology Mapping via Reinforcement Learning Guided Library Tuning |
| 1098 | Enabling Robust Inverse Lithography with Rigorous Multi-Objective Optimization |
| 1107 | Differentiable Edge-based OPC |
| 1114 | An Access Pattern-aware Hybrid Learning-based and Conventional Mapping for Solid-State Drives |
| 1117 | ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters |
| 1121 | Peak Power and Dynamic IR-drop Assessment via Waveform Augmenting |
| 1130 | PulseRF: Physics Augmented ML Modeling and Synthesis for High-Frequency RFIC Design |
| 1140 | AGC: A Unified Architecture for Accelerating K-Nearest Neighbor Graph Construction in Vector Search |
| 1148 | OFT: An accelerator with eager gradient prediction for attention training |
| 1156 | A Processing-using-Memory Architecture for Commodity DRAM Devices with Enhanced Compatibility and Reliability |
| 1166 | A Physical and Timing Aware Placement Optimization Framework Based on Graph Neural Network |
| 1169 | Leda: Leveraging Tiling Dataflow to Accelerate SpMM on HBM-Equipped FPGAs for GNNs |
| 1179 | RISCSparse: Point Cloud Inference Engine on RISC-V Processor |
| 1188 | RTLRewriter: Methodologies for Large Models aided RTL Code Optimization |
| 1195 | Towards Floating Point-Based Attention-Free LLM: Hybrid PIM with Non-Uniform Data Format and Reduced Multiplications |
| 1196 | Sustainable Hardware Specialization |
| 1212 | LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer |
| 1218 | EPipe: Pipeline Inference Framework with High-quality Offline Parallelism Planning for Heterogeneous Edge Devices |
| 1220 | MatFactory: A Framework for High-performance Matrix Factorization on FPGAs |
| 1222 | HLSPilot: LLM-based High-Level Synthesis |
| 1223 | RankTuner: When Design Tool Parameter Tuning Meets Preference Bayesian Optimization |
| 1226 | Potter: A Parallel Overlap-Tolerant Router for UltraScale FPGAs |
| 1232 | Optimal Layout Synthesis of Multi-Row Standard Cells for Advanced Technology Nodes |
| 1236 | EI-PIT: A Parallel-in-Time Exponential Integrator Method for Transient Linear Circuit Simulation |
| 1237 | APINT: A Full-Stack Framework for Acceleration of Privacy-Preserving Inference of Transformers based on Garbled Circuits |
| 1239 | InstantGR: Scalable GPU Parallelization for Global Routing |
| 1240 | Balor: HLS Source Code Evaluator Based on Custom Graphs and Hierarchical GNNs |
| 1245 | Pseudo Adjoint Optimization: Harnessing the Solution Curve for SPICE Acceleration |
| 1248 | FlexInt: A New Number Format for Robust Sub-8-Bit Neural Network Inference |
| 1249 | DeepGate3: Towards Scalable Circuit Representation Learning |
| 1263 | TReCiM: Lower Power and Temperature-Resilient Multibit 2FeFET-1T Compute-in-Memory Design |
| 1265 | UFO-MAC: A Unified Framework for Optimization of High-Performance Multipliers and Multiply-Accumulators |
| 1266 | CSP: Comprehensive Sparsification Preconditioning for Nonlinear Circuit Simulation |
| 1278 | RL-Fill: Timing-Aware Fill Insertion Using Reinforcement Learning |
| 1280 | Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization |
| 1288 | A Machine Learning Guided Cut Choices for ASIC Technology Mapping |
| 1293 | MARCA: Mamba Accelerator with Reconfigurable Architecture |
| 1296 | LAG-Sizer: A Novel Gate Sizer Based on Leak Generative Adversarial Network with Feature Fusion |
| 1314 | ShiftCAM: A Time-Domain Content Addressable Memory Utilizing Shifted Hamming Distance for Robust Genome Analysis |
| 1315 | MEIC: Re-thinking RTL Debug Automation using LLMs |
| 1323 | Minimizing Worst-Case Data Transmission Cycles in Wavelength-Routed Optical NoC through Bandwidth Allocation |
| 1329 | Hybrid Modeling and Weighting for Timing-driven Placement with Efficient Calibration |
| 1337 | Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights |
| 1338 | A Hypergraph Partitioner Utilizing a Novel Graph Generative Model |
| 1349 | Towards Uncertainty-Quantifiable Biomedical Intelligence: Mixed-signal Compute-in-Entropy for Bayesian Neural Networks |
| 1350 | A Framework for Explainable, Comprehensive, and Customizable Memory-Centric Workloads |
| 1357 | DoS-FPGA: Denial of Service on Cloud FPGAs via Coordinated Power Hammering |
| 1361 | AMAZE: Accelerated MiMC Hardware Architecture for Zero-Knowledge Applications on the Edge |
| 1386 | Multi-Tier 3D SRAM Module Design: Targeting Bit-Line and Word-Line Folding |
| 1387 | Detecting Fraudulent Services on Quantum Cloud Platforms via Dynamic Fingerprinting |
| 1397 | Neural Architecture Search for Highly Bespoke Robust Printed Neuromorphic Circuits |
| 1403 | A Built-In Integrated Rowhammer, Rowpress, and Leakage Detection Sensor for DRAM |
| 1410 | FloorSet - a VLSI Floorplanning Dataset with Design Constraints of Real-World SOCs. |
| 1428 | Towards Energy-Aware Federated Learning via MARL: A Dual-Selection Approach for Model and Client |
| 1432 | ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language Models |
| 1436 | AI-Driven Evaluation and Optimization of Bump Pitch Effects on Chiplet and Interposer Design Quality |
| 1440 | Modern Fixed-Outline Floorplanning with Rectilinear Soft Modules |
| 1444 | SMT-based Layout Synthesis for Silicon-based Quantum Computing with Crossbar Architecture |
| 1467 | TSB: Tiny Shared Block for Efficient DNN Deployment on NVCIM Accelerators |
| 1472 | ASCENT: Amplifying Power Side-Channel Resilience via Learning & Monte-Carlo Tree Search |
| 1492 | LaserEscape: Detecting and Mitigating Optical Probing Attacks |
| 1500 | Reinforcement Learning-Enhanced Cloud-Based Open Source Analog Circuit Generator for Standard and Cryogenic Temperatures in 130-nm and 180-nm OpenPDKs |
| 1501 | An Effective ECO Methodology for Reducing Back-side Design Rule Violations in Double-sided Signal Routing |
| 1512 | Three Guides for Efficient Automatic Post-Fabrication Optimization of Modern NAND Flash Memory |
| 1516 | Spiking Transformer Hardware Accelerators in 3D Integration |
| 1521 | Accurate, Yet Scalable: A SPICE-based Design and Optimization Framework for eNVM based Analog In-memory Computing |
| 1530 | Placement Tomography-Based Routing Blockage Generation for DRV Hotspot Mitigation |
| 1534 | Analyzing the Impact of FinFET Self-Heating on the Performance of RF Power Amplifiers |
| 1539 | OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection |
| 1551 | RACI: A Resource-Aware Cooperative Inference Framework on Heterogeneous Edge Devices |
| 1552 | Hyena: Optimizing Homomorphically Encrypted Convolution for Private CNN Inference |
| 1554 | TopoOrderPart: a Multi-level Scheduling-Driven Partitioning Framework for Processor-Based Emulation |
| 1560 | PolarGate: Breaking the Functionality Representation Bottleneck of And-Inverter Graph Neural Network |
| 1561 | CoCoA: Algorithm-Hardware Co-Design for Large-Scale GNN Training using Compressed Graph |
| 1578 | eXpect: On the Security Implications of Violations in AXI Implementations |
| 1579 | TP-DCIM: Transposable Digital SRAM CIM Architecture for Energy-Efficient and High Throughput Transformer Acceleration |
| 1581 | μLAM: A LLM-Powered Assistant for Real-Time Micro-architectural Attack Detection and Mitigation |
| 1590 | Explainable and Layout-Aware Timing Prediction |
| 1600 | Mixed-precision Neural Networks on RISC-V Cores: ISA extensions for Multi-Pumped Soft SIMD Operations |
