Micro-HPC Related Publications
[Journal]
J9. Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs
Qiong Chang, Xinyuan Chen, Xiang Li, Weimin Wang, Jun Miyazaki
ACM Transactions on Embedded Computing Systems, 2025 [bib|DOI]
J8. Accelerating Nearest Neighbor Search in 3D Point Cloud Registration on GPUs
Qiong Chang, Weimin Wang, Jun Miyazaki
ACM Transactions on Architecture and Code Optimization, 2025 [bib|DOI]
J7. An Optimized GPU Implementation for GIST Descriptor
Xiang Li, Qiong Chang, Aolong Zha, Shijie Chang, Yun Li, Jun Miyazaki
ACM Transactions on Architecture and Code Optimization, 2024 [bib|DOI]
J6. TinyStereo: A Tiny Coarse-to-Fine Framework for Vision-based Depth Estimation on Embedded GPUs
Qiong Chang, Xin Xu, Aolong Zha, Yongqing Sun, Yun Li
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2024 [bib|DOI]
J5. High-precision plant height measurement by drone with RTK-GNSS and single camera for real-time processing
Yuta Matsuura, Heming Zhang, Kousuke Nakao, Qiong Chang, Firmansyah Iman, Shin Kawai, Yoshiki Yamaguchi, Tsutomu Maruyama, Hisayoshi Hayashi, Hajime Nobuhara
Scientific Reports, 2023 [bib|DOI]
J4. Multi-Directional Sobel Operator Kernel on Gpus
Qiong Chang, Xiang Li, Yun Li, Jun Miyazaki
Journal of Parallel and Distributed Computing, 2023 [bib|DOI]
J3. An Incremental SAT-Based Approach for Solving the Real-Time Taxi-Sharing Service Problem
Aolong Zha, Qiong Chang, Itsuki Noda
Discrete Applied Mathematics, 2023 [bib|DOI]
J2. Efficient Stereo Matching on Embedded GPUs with Zero-Means Cross Correlation
Qiong Chang, Aolong Zha, Weimin Wang, Xin Liu, Masaki Onishi, Lei Lei, Tsutomu Maruyama
Journal of Systems Architecture, 2022 [bib|DOI]
J1. Real-Time Stereo Vision System: A Multi-Block Matching on GPU
Qiong Chang, Tsutomu Maruyama
IEEE Access, 2018 [bib|DOI]
[Conference]
C14, FSAC-IA: A HIERARCHICAL CONSTRUCTED SAC-IA ALGORITHM FOR POINT CLOUD ALIGNMENT ACCELERATION
Ziyang Yu, Qiong Chang, Jun Miyazaki
The IEEE International Conference on Image Processing [bib|DOI]
C13, Efficient Parallel Implementation of Non-Local Means Algorithm on GPU
Xiang Li, Qiong Chang, Yun Li and Jun Miyazaki
17th Workshop on General Purpose Processing Using GPU (GPGPU2025), 2025 [bib|DOI]
C12, K-way In-place Merge by CPU-GPU Cooperative Processing
Shinya Miura, Qiong Chang, Jun Miyazaki
35th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), 2024 [bib|DOI]
C11, Extension of Parallel Primitives and Their Applications to Large-Scale Data Processing
Masashi Nakano, Qiong Chang, Jun Miyazaki
In 35th International Conference on Database and Expert Systems Applications (DEXA), 2024 [bib|DOI]
C10, Acceleration of Neural Network Inference for Embedded GPU Systems
Kei Terakura, Qiong Chang, Jun Miyazaki
International Conference on Big Data and Smart Computing (BigComp), 2024 [bib|DOI]
C9 GPU Acceleration of Multi-object Tracking with Motion Vector interpolation and Affine Transformation
Yoshiki Kunimoto, Qiong Chang, Yashiki Yamaguchi, Tsutomu Maruyama
34th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), 2023 [bib|DOI]
C8, VAN-ICP: GPU-Accelerated Approximate Nearest Neighbor Search for ICP Registration via Voxel Dilation
Weimin Wang, Qiong Chang
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2023 [bib|DOI]
C7, StereoVAE: A lightweight stereo-matching system using embedded GPUs
Qiong Chang, Xiang Li, Xun Xi, Xin Liu, Yun Li, Jun Miyazaki
International Conference on Robotics and Automation (ICRA), 2023 [bib|DOI]
C6, Acceleration of video stabilization using embedded GPU
Yuzuki Mimura, Qiong Chang, Tsutomu Maruyama
In IEEE 33rd International Conference on Application-specific Systems, Architectures and Processors (ASAP), 2022 [bib|DOI]
C5, Fast SQL/Row Pattern Recognition Query Processing using Parallel Primitives on GPUs
Tsubasa Ohara, Qiong Chang, Jun Miyazaki
In 32nd International Conference on Database and Expert Systems Applications (DEXA), 2021 [bib|DOI]
C4, Z2-ZNCC:ZigZag Scanning based Zero-means Normalized Cross Correlation for Fast and Accurate Stereo Matching on Embedded GPU
Qiong Chang, Aolong Zha, Weimin Wang, Masaki Onishi, Tsutomu Maruyama
In IEEE 38th International Conference on Computer Design (ICCD), 2020 [bib|DOI]
C3, A GPU Accelerator for Domain Transformation-Based Stereo Matching
Qiong Chang, Aolong Zha, Masaki Onishi, Tsutomu Maruyama
In Proceedings of the 2nd International Conference on Algorithms, Computing and Artificial Intelligence (ACAI), 2019 [bib|DOI]
C2, Real-Time High-Quality Stereo Matching System on a GPU
Qiong Chang, Tsutomu Maruyama
In IEEE 29th International Conference on Application-specific Systems, Architectures and Processors (ASAP), 2018 [bib|DOI]
C1, Fast convolution kernels on pascal GPU with high memory efficiency
Qiong Chang
In Proceedings of the 26th High Performance Computing Symposium (HPC), 2018 (Best Paper Award) [bib|DOI]