Research Programmes

Computing-in-Memory Architecture for Large-scale AI Models

RP1

Tackling the compute, memory, and memory bandwidth scaling problems, particularly focusing on advancing CIM-based chip architecture to enable the ultra-energy-efficient and scalable deployment of large-scale AI models.

Explore
Hardware-Software Co-Design for Energy-Efficient and High-Performance Edge AI

RP2

Advancing the field of algorithm-hardware co-design, addressing new challenges arising from the increasing complexity, scalability, energy efficiency, and latency requirements of embodied large-scale AI applications.

Explore
EDA (Electronic Design Automation) for Large-Scale AI Hardware

RP3

Exploring new frontiers in EDA for AI chip and system design, with the primary focus placed on supporting the chiplet- and 3DIC-based designs, overcoming the limitations of existing EDA tools, and exploring the employment of foundation AI models in various design tasks to enable more agile chip design.

Explore
Cross-layer Optimization and Demonstration for Targeted Applications

RP4

Leveraging the technologies developed in the above 3 research projects to perform cross-layer optimization and develop hardware-accelerated working systems for domain-specific, large-scale AI applications with revolutionary performance.

Explore

RP2 - Hardware-Software Co-Design for Energy-Efficient and High-Performance Edge AI

Advancing the field of algorithm-hardware co-design, addressing new challenges arising from the increasing complexity, scalability, energy efficiency, and latency requirements of embodied large-scale AI applications.

RP2-1:
Hardware/Software Co-Optimization Toward Energy-Minimal Personalized Health-AI Companions

Hardware/software co-optimization toward energy-minimal personalized health-AI companions

This project aims to create ultra-low-power AI companions for chronic health condition monitoring. It focuses on co-optimizing feature extraction and machine learning algorithms to meet the strict energy and performance requirements of edge devices. Novel strategies for in-memory and near-memory computing will be developed to minimize energy consumption while ensuring high-quality real-time processing of bio-signals.

RP2-2:
Energy Efficient Heterogeneous Edge-Cloud Collaborative Learning Paradigm for Large-Scale AI Models with Privacy Preservation

Energy-efficient heterogeneous edge-cloud collaborating learning paradigm for large-scale AI models with privacy preservation.

This project addresses the challenge of deploying large-scale AI models on resource-constrained edge devices without compromising performance or privacy. By leveraging a heterogeneous framework for hardware and neural architecture co-exploration, this project aims to enable efficient and dynamic deployment of large-scale AI models. Additionally, an energy-aware cloud-edge collaborative learning paradigm will be developed to optimize energy consumption and enhance learning efficiency, particularly in federated learning scenarios.

RP2-3:
A Hardware-aware Software Optimization of 3D Neural Rendering Algorithms for Real-time Acceleration in Edge Devices

A hardware-aware software optimization of 3D neural rendering algorithms for real-time acceleration in edge devices

This project targets the optimization of recent Neural Radiance Field (NeRF) algorithms for mobile rendering and simultaneous localization and mapping (SLAM). To address the intrinsic high computational and power overhead, this project proposes a hardware-aware software co-design, optimizing data representation, hierarchical volume sampling, and reusable pixel determination for rendering, thereby achieving significant improvements in latency and energy efficiency.

RP2-4:
Energy-efficient Inference and Fine-tuning of Diffusion Transformers using Dynamic Mixed-precision Arithmetic for Generative AI

Energy-efficient inference and fine-tuning of diffusion transformers using dynamic mixed-precision arithmetic for generative AI.

This project tackles the challenges associated with Transformer-based large-scale AI models by developing a mixed-datatype accelerator that supports both integer and floating-point operations, facilitating efficient inference and model fine-tuning on edge devices. The co-design of hardware and software for style transfer applications will demonstrate the practical benefits of this approach.

RP2 - Hardware-Software Co-Design for Energy-Efficient and High-Performance Edge AI

RP2-1: Hardware/Software Co-Optimization Toward Energy-Minimal Personalized Health-AI Companions

RP2-2: Energy Efficient Heterogeneous Edge-Cloud Collaborative Learning Paradigm for Large-Scale AI Models with Privacy Preservation

RP2-3: A Hardware-aware Software Optimization of 3D Neural Rendering Algorithms for Real-time Acceleration in Edge Devices

RP2-4: Energy-efficient Inference and Fine-tuning of Diffusion Transformers using Dynamic Mixed-precision Arithmetic for Generative AI

RP2-1:
Hardware/Software Co-Optimization Toward Energy-Minimal Personalized Health-AI Companions

RP2-2:
Energy Efficient Heterogeneous Edge-Cloud Collaborative Learning Paradigm for Large-Scale AI Models with Privacy Preservation

RP2-3:
A Hardware-aware Software Optimization of 3D Neural Rendering Algorithms for Real-time Acceleration in Edge Devices

RP2-4:
Energy-efficient Inference and Fine-tuning of Diffusion Transformers using Dynamic Mixed-precision Arithmetic for Generative AI