PerfXLab 澎峰科技

Discovering Potential. Driving Performance.

About Us

PerfXLab is a heterogeneous computing solution company. We provide performance products and services for embedded systems, cloud computing, as well as for artificial intelligence application needs.

Our team members come from MIT, and Chinese Academy of Sciences, all of them being seasoned developers and researchers in performance optimization. We have led the development of OpenBLAS, as well as the OpenCL module in OpenCV.

We created InferXLite AI software stack for embedded systems. With PerfBox hardware, we empower our partners by embedded AI turn-key solution. For RISC-V community, we provide the cost-effective RVBoards open hardware and software.


PerfBLAS is an optimized BLAS library for deep learning on ARM embedded system.

PerfBLAS can improve AlexNet model about 100% on ARMv7 and ARMv8 CPU cores.

PerfBLAS supports Linux and Android OS.


InferXLite is a lightweight deep learning inference framework for embedded systems. It supports ARM CPUs, ARM Mali GPUs, AMD APU SoCs, and NVIDIA GPU.

We provide the tool to covert DNN models (including Caffe model format and Darknet model format). In future, we will support other model formats.


PerfBox is an ARM 64-bit embedded board. With InferXLite, and PerfBLAS AI software stack, we achieved squeezenet v1.1 model 87ms on single CPU core.

PerfBox is used by smart camera, facial recognition system, new retail, and etc.


Perf-FPGA is the AI computing solution for FPGA embedded system. It supports deep learning based objects dection and tracking.

We already deploy Perf-FPGA solution in drone, video surveilance, and eduction application.

Perf-FPGA includes DL-Quants, DL-Compiler, and DL-Accelerator.


Deep learning application on AMD APU solution:

  • High Performance: Combining high performance InferXLite+PerfDNN and AMD APU SoC, we can achieve realtime deep learning inference.
  • Low power: TDP 15W, 12-25W configurable
  • Support various models: YOLO V3/V2, SSD object detection models.
  • Support image processing, object detection and tracking.
  • Support H.264/H.265 Video & Camera
  • Provide integrated development board and InferXLite software stack.


Perf-V is a FPGA demo board designed for RISC-V opensource community. It integrates various peripheral chips and offers many interfaces.

Perf-V has great flexibility and transplant multiple architectures. We prepare various materials for you to learn out product and offer a perfect experiment platform for the design of RISC-V and FPGA products. It is a preferred hardware for you to study, develop programs and make a demo.

Where to buy:

High Performance libraries and software stacks Services

Our services include:

  • Deploying and Optimazing OpenBLAS for custom hardwares and applications.
  • Developing high performance libraries, including BLAS, LAPACK, FFT, IPP library, and Computer Vision library.
  • Developing deep learning software stacks for custom hardwares and applications.
  • Performance tuning.

Join Us

Job openings list: HPC software developer(C/C++/Performance Optimazation), Embedded software developer, FPGA engineer, Computer Vision algrithm engineer.

Full time employee or internship. Please send cv to [email protected].

Stay In Touch: