ROBORACTION.AI
​"I am Seasoned Expert Scientist/Engineer for Computer Vision/Machine Learning/Autonomous Driving/LLM-based Embodied Agent."
"​I am Seasoned Scientist/Engineer for Informarion Retrieval, Natural Language Processing, Speech Recognition, LLM-based chatbot and Embodied Agent."
Experts
Apply AI for Different Data/Agent Domains
Computer Vision
Images, Videos
Machine Learning
Deep Learning
Autonomous Driving
Perception, Mapping, Localization, Prediction, Planning, Simulation
LLM, Visual-Language Model, Large Scale Multi-modality Model, LLM-based Embodied Agent
Large Foundation Model
I have worked in industry field for more than 20 years
30
Years of Experience in Computer Vision
20
Years of Experience in Machine Learning
7
Years of Experience in Auttonomous Driving
5
Years of Experience in VR/AR/Visualization
Some Projects being Led or Involved
image/video processing for noise removal and quality enhancement;
object detection, tracking and segmentation;
sensor calibration and fusion by deep learning;
virtual reality and augmented reality;
vision-based localization with HD map;
HD map building with cameras;
Simulation of autonomous driving with Carla;
trajectory prediction with interaction modeling;
sampling-based trajectory planning with neural models;
data closed loop of autonomous driving with smart mile selection and automatic annotation;
V2X collaboration from a hierarchical data embedding framework by deep learning.
Technologies
Our Experience
"I am seasoned expert in computer vision/machine learning/Autonomous driving/LLM-based Embodied Agents. I have worked in various companies in semi-conductor, tele-communication, internet, multimedia and manufacture etc. I once was a VP of research in a autonomous driving chip startup company, a chief scientist of autonomous driving in a OEM's software subsidiary, a senior architect in an internet public company, a senior staff architect of computer vision and deep learning in a world top-3 semi-conductor company, a senior staff R&D engieer in a world top-3 semi-conductor company, and senior researcher in a world top-3 telecommunication company.
I joined many projects in research and produc developments, from different areas, like computer vision, VR, AR, multimedia and security etc."
​
"I am a seasoned scientist and engineer in speech recognition, information retrieval, natural language processing, chatbot and LLM-based embodied agent. I got the ECE Ph D in top 5 major rank of USA."
Technologies
Computer Vision, VR/AR and Human Machine Interaction
-
GPS-based
-
BEV network
-
Occupancy Network
-
Data driven prediction and planning
​
-
Spherical View Rendering
-
Image-based Rendering
-
Depth Image-based Rendering
-
Camera Pose Estimation and Tracking
-
Image-based Relocalization
-
Volume Rendering
​
Technologies
Autonomoy and LLM-based Embodied Agent
-
Image Classification and Search/Retrieval
-
Visual Object Detection and Tracking
-
Visual Scene Segmentation
-
Sensor Calibration and Fusion
-
Driving Behavior modeling and prediction
-
Pedestrian/Cyclist behavior modeling and prediction
-
HD map generation and localization
-
Simulation of traffic scenes with road network
-
Data closed loop with smart data selection and automatic annotation
-
BEV /Occupancy perception network
​
-
LLM (chatGPT, GPT-4.0);
-
Visual language model (CLIP, DALL-E);
-
Multi-modality model (PaLM-E, GPT-4V);
-
Embodied AI for LLM-based agents (RT-X);
-
Fine-tuning (adapter tuning, prefix-tuning, instruct-tuning, prompt tuning);
-
Emergence (in-context learning);
-
Human preference alignment (RLHF);
-
Hauccination and interpretivity;
-
Knowlege graph and Reasoning on graph (RoG);
-
Search engine and Retrieval augmented generation (RAG).