Autonomous Object Tracking and Obstacle Avoidance on a Gobilda Robot

Gobilda Robot Setup

Abstract

This project developed an autonomous control system for a Gobilda robot equipped with an OAK-D Pro camera and RPLIDAR A1 sensor. It uses a finite state machine (FSM) approach to allow the robot to track people while avoiding obstacles. The system integrates computer vision and LiDAR for real-time navigation using ROS2 middleware. The robot demonstrates smooth behavioral transitions and performance suitable for dynamic environments, validating its capability for applications requiring both environment awareness and target following.

Introduction

Software Architecture

Built on ROS2 and a node-based structure, the Object Tracking Node subscribes to:

Velocity commands are published to cmd_vel.
Neural Network: MobileNet-SSD selected for fast and reliable person detection.

ROS2 Node Diagram
Figure 1: ROS2 Node Architecture

Finite State Machine Design

Finite State Machine Diagram
Figure 2: Finite State Machine Diagram

Implementation Details

Object Detection & Tracking

The camera uses MobileNet-SSD for real-time person detection. Upon detection, positional data (x, y, z) is extracted and processed to determine proximity and direction relative to the robot.

Object Detection Overlay
Figure 3: MobileNet Detection Overlay

Obstacle Avoidance

LiDAR data (ranges 340–800) monitors the area ahead. Obstacles within 0.6 m trigger avoidance states. If no LiDAR data, the robot enters recovery.

Motion Control

Proportional control adjusts linear and angular velocities based on the detected target’s vector position. Separate gains are used for right and left turns to counteract hardware asymmetry. The logic reduces speed as the robot approaches its target to minimize oscillations for smooth tracking.

Challenges & Solutions

Demonstration Results

Future Improvements

Conclusion

This project validated a multi-modal sensor fusion approach for real-time autonomous object tracking and obstacle avoidance, leveraging effective state logic and proportional control to reliably navigate dynamic environments.


References