top of page

ROBORACTION.AI

​"I am Seasoned Expert Scientist/Engineer for Computer Vision/Machine Learning/Autonomous Driving/LLM-based Embodied Agent."

"​I am Seasoned Scientist/Engineer for Informarion Retrieval, Natural Language Processing, Speech Recognition, LLM-based chatbot and Embodied Agent."

COMBG.jpeg
Services

Experts

Apply AI for Different Data/Agent Domains

Computer Vision

Images, Videos

Machine Learning

Deep Learning

Autonomous Driving

Perception, Mapping, Localization, Prediction, Planning, Simulation

LLM, Visual-Language Model, Large Scale Multi-modality Model, LLM-based Embodied Agent

Large Foundation Model

Blue Flowers

I have worked in industry field for more than 20 years

30

Years of Experience in Computer Vision

20

Years of Experience in Machine Learning

7

Years of Experience in Auttonomous Driving

5

Years of Experience in VR/AR/Visualization

Some Projects being Led or Involved

image/video processing for noise removal and quality enhancement;

object detection, tracking and segmentation;

sensor calibration and fusion by deep learning;

virtual reality and augmented reality;

vision-based localization with HD map;

HD map building with cameras;

Simulation of autonomous driving with Carla;

trajectory prediction with interaction modeling;

sampling-based trajectory planning with neural models;

data closed loop of autonomous driving with smart mile selection and automatic annotation;

V2X collaboration from a hierarchical data embedding framework by deep learning.

About

Technologies

Our Experience

"I am seasoned expert in computer vision/machine learning/Autonomous driving/LLM-based Embodied Agents. I have worked in various companies in semi-conductor, tele-communication, internet, multimedia and manufacture etc. I once was a VP of research in a autonomous driving chip startup company, a chief scientist of autonomous driving in a OEM's software subsidiary, a senior architect in an internet public company, a senior staff architect of computer vision and deep learning in a world top-3 semi-conductor company, a senior staff R&D engieer in a world top-3 semi-conductor company, and senior researcher in a world top-3 telecommunication company.

I joined many projects in research and produc developments, from different areas, like computer vision, VR, AR, multimedia and security etc."

​

"I am a seasoned scientist and engineer in speech recognition, information retrieval, natural language processing, chatbot and LLM-based embodied agent.  I got the ECE Ph D in top 5 major rank of USA."

Technologies

Computer Vision, VR/AR and Human Machine Interaction

  1. Stereo Vision

  2. Multi View Stereo (MVS)

  3. Visual Odometry

  4. Visual SLAM

  5. Inertial-based

  6. GPS-based

  7. BEV network

  8. Occupancy Network

  9. Data driven prediction and planning

​

  1.  Spherical View Rendering

  2. Image-based Rendering

  3. Depth Image-based Rendering

  4. Camera Pose Estimation and Tracking

  5. Image-based Relocalization

  6. Volume Rendering

​

  1. Speech Recognition

  2. Speech Synthesize

  3. Natural Language Processing (NLP)

  4. Chatbot

  5. Face Detection

  6. Face Recognition

  7. Facial Landmarks Detection

  8. Facial Expression/Emotion Classification

  9. Hand Gesture Tracking

  10. Body Pose Estimation and Tracking.

Technologies

Autonomoy and LLM-based Embodied Agent

  1. Image Classification and Search/Retrieval

  2. Visual Object Detection and Tracking

  3. Visual Scene Segmentation

  4. Sensor Calibration and Fusion

  5. Driving Behavior modeling and prediction

  6. Pedestrian/Cyclist behavior modeling and prediction

  7. HD map generation and localization

  8. Simulation of traffic scenes with road network

  9.  Data closed loop with smart data selection and automatic annotation

  10. BEV /Occupancy perception network

​

  1. LLM (chatGPT, GPT-4.0);

  2. Visual language model (CLIP, DALL-E);

  3.  Multi-modality model (PaLM-E,  GPT-4V);

  4.  Embodied AI for LLM-based agents (RT-X);

  5. Fine-tuning (adapter tuning, prefix-tuning, instruct-tuning, prompt tuning);

  6. Emergence (in-context learning);

  7. Human preference alignment (RLHF);

  8. Hauccination and interpretivity;

  9.  Knowlege graph and Reasoning on graph (RoG);

  10.  Search engine and Retrieval augmented generation (RAG).

Citrus Fruits
Testimonials

Consulting Staffs

Yu Huang, CEO and Chief Scientist
​

Roboraction.AI

Clients

Contact

Contact

CONTACT

Company Address

3309 N Mississippi Ave #202,

Portland, OR 97227 

  • LinkedIn
  • Facebook
  • Twitter

Thanks for submitting!

bottom of page