Selected Research

Highlighted research projects with performance benchmarks

TACO 2025

Accelerating Nearest Neighbor Search in 3D Point Cloud Registration on GPUs

Qiong Chang, Weimin Wang, Jun Miyazaki
A GPU-accelerated method to significantly speed up nearest neighbor search for 3D point cloud registration, enhancing real-time performance in high-density spatial data processing.
GPU 3D Vision Point Cloud
12×
speedup
TECS 2025

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs

Qiong Chang, Xinyuan Chen, Weimin Wang, Xiang Li, Jun Miyazaki
Two methods to accelerate the most time-consuming steps in Oriented FAST feature detection: FAST feature point detection and Harris corner detection.
Embedded GPU Feature Detection
2.2×
speedup
TACO 2025

3D GNLM: Efficient 3D Non-Local Means Kernel with Nested Reuse Strategies for Embedded GPUs

Xiang Li, Qiong Chang*, Yun Li, Jun Miyazaki
An efficient parallel implementation of the 3D Non-Local Means denoising algorithm on GPU, significantly accelerating performance for high-resolution medical image processing tasks.
GPU Medical Imaging
5.5×
speedup
TACO 2024

An Optimized GPU Implementation for GIST Descriptor

Xiang Li, Qiong Chang*, Aolong Zha, Shijie Chang, Yun Li, Jun Miyazaki
An optimized GPU-based implementation of the GIST descriptor, significantly accelerating image feature extraction for large-scale visual processing tasks.
GPU Feature Extraction
6.4×
speedup
IEEE TSMC 2024

TinyStereo: A Tiny Coarse-to-Fine Framework for Vision-based Depth Estimation on Embedded GPUs

Qiong Chang, Xin Xu, Aolong Zha, Yongqing Sun, Yun Li
A lightweight coarse-to-fine stereo matching framework optimized for embedded GPUs, enabling efficient and accurate depth estimation under constrained resources.
Embedded GPU Stereo Matching Depth Estimation
22
fps on TX2
JPDC 2023

Multi-Directional Sobel Operator Kernel on GPUs

Qiong Chang, Xiang Li, Yun Li, Jun Miyazaki
A GPU-accelerated multi-directional Sobel operator kernel for efficient and parallel edge detection across multiple gradient orientations.
GPU Edge Detection
11×
speedup
JSA 2022

Efficient Stereo Matching on Embedded GPUs with Zero-Means Cross Correlation

Qiong Chang, Aolong Zha, Weimin Wang, Xin Liu, Masaki Onishi, Lei Lei, Tsutomu Maruyama
Fast ZNCC feature matching on embedded GPUs, offering an effective real-time alternative to traditional Census in stereo matching.
Embedded GPU Stereo Matching
speedup