Positioning algorithm for AGV autonomous driving platform based on artificial neural networks

Bałazy, Patryk; Gut, Paweł; Knap, Paweł

doi:10.21595/rsa.2021.22109

Robotic Systems and Applications

Browse Journal

Submit article

Published: 11 August 2021

Check for updates

Positioning algorithm for AGV autonomous driving platform based on artificial neural networks

Patryk Bałazy¹

Paweł Gut²

Paweł Knap³

^{1, 2, 3}Faculty of Mechanical Engineering and Robotics, AGH University of Science and Technology, Kraków, Poland

Corresponding Author:

Patryk Bałazy

Cite the article Download PDF

Downloads 817

CrossRef Citations 10

Abstract

This paper presents an artificial intelligence algorithm responsible for the autonomy of a platform. The proposed algorithm allows the platform to move from an initial position to a set one without human intervention and with understanding and response to the dynamic environment. The implementation of such a task is possible by using a combination of a camera identifying the environment with a laser LIDAR sensor and a vision system. The signals from the sensors are analysed through convolutional neural networks. Based on AI inference, the platform makes decisions, including determining the optimal path for itself. A transfer learning method will be used to teach the neural network. This article presents the results of learning the applied neural algorithm.

Positioning algorithm for AGV autonomous driving platform based on artificial neural networks

Highlights

The result of the work is a properly working algorithm that enables the mobile platform to move autonomously to a given destination.
The DDPG agent was used as the first choice in environments with continuous action spaces and observation.
The trained DDPG agent was able to drive the platform correctly.

1. Introduction

The everyday life faced by people with mobility impairments is a real challenge for them. They have problems getting from A to B. Of course, they are helped by people such as carers or nurses. However, they are also people who sooner or later get tired and who have their own daily needs. Electric wheelchairs are another way of helping people with disabilities to get around. These take the strain off the arm muscles of people who have mobility problems or complete leg inertia. However, the biggest problem is the movement of people who are not even able to steer the previously mentioned electric wheelchair. An AGV autonomous driving platform equipped with a neural positioning algorithm comes to the aid of people with disabilities.

An autonomous AGV equipped with a positioning algorithm is an innovative solution that can make everyday life easier for people with mobility impairments. The AGV is connected to the cloud, where data on the location of a given place is stored, as well as who, when, how and why it was prepared. An algorithm controls the platform in such a way that it moves the disabled person between two points, choosing the fastest possible route.

The issue of AGV control is the subject of extensive university research [1]-[3]. These are based on image analysis networks [4]-[6], position analysis networks [7], [8], and fuzzy logic [9]-[13]. This paper describes the study of a platform autonomy support algorithm using a reinforcement learning technique in combination with the use of artificial neural networks. The aim of this method is to train an agent to perform a task in an uncertain environment. The agent receives observations from the LIDAR sensor and a reward for its actions from the environment and in response sends actions (called actions) to the agent. The reward is a measure of the success of the action with respect to the achievement of the task goal, defined by the reward function. The agent's task is to accumulate the reward, i.e. to learn how to interact with the environment in the most beneficial way.

2. Environment model

The mobile platform for which the algorithm is developed is a four-wheeled mobile robot equipped with a two-dimensional LIDAR environment scanner. Simulink software was used to simulate the operation of the robot and the sensors. An environment model (FIG1) was created containing the following factors: kinematics model, LIDAR sensor operation simulator, collision signalling, adding noise to LIDAR readings and reward function. The mobile robot uses kinematics based on a differential mechanism. The modelled environment was then used in a further scheme to link it to the agent's actions. It uses the output of the environment, these are observations, collision information and reward value. The agent uses this data both during learning and during simulation.

Fig. 1Block diagram of the environment model

For the model created, the actions are the linear and angular velocity preset and the observations are the readings from the LIDAR scanner. These are normalised and given a range of –1 to 1, where the first value represents clockwise rotation at maximum speed or backward travel at maximum speed, and the second value represents counterclockwise rotation at maximum speed or forward travel at maximum speed.

3. Reward function

A very important element of reinforcement learning is the reward function. It is based on this function that the agent decides which actions to perform and for which it receives the greatest reward. It was used, among others, to ask the final location of our robot. For this purpose, in the course of research work, two reward functions were designed, which allow to reach two different locations on the map. This is defined as a three-dimensional function that has the coordinates of the robot on the $x$ and $y$ axes, while the reward value is defined on the $z$ axis. Each function has one maximum for the coordinates of the robot's final destination. It was also decided that the reward function should only take negative values, this prevents the reward from accumulating while the robot stands still. The designed functions are presented in three-dimensional space (Fig. 2, Fig. 3) and in two-dimensional space superimposed on a room map (Fig. 4, Fig. 5).

Fig. 2Reward function for the room in the bottom right corner of the map

Fig. 3Reward function for the room in the top right corner of the map

Fig. 4Room map with superimposed reward function 1

Fig. 5Room map with superimposed reward function 2

The higher the intensity of the colours in the graph, the higher the value of the reward function, which makes the robot more likely to head towards higher values of the reward function. With the help of the reward function it is possible to define the final destination that the robot should go to. As can be seen in Fig. 4, the most intense colours are in the room at the bottom right of the map and this is where the robot will head. Fig. 5 shows the reward function for which the robot will be directed to the room located in the upper right corner of the map.

4. Training algorithm/result

Once the environment model has been created, and the reward function designed, the process of teaching the agent can proceed. This is a time-consuming process, as each episode involves simulating the agent in the Simulink environment. It also requires a lot of computing power. Learning is completed when the average reward is reached. Fig. 6 shows the progress of agent learning after three hours. The DDPG agent was used because it is often proposed as the first choice in environments with continuous action spaces and observation. The actor and critic functions are deep neural networks. DDPG agent is an actor-critic, model-free algorithm based on the deterministic policy gradient.

Fig. 6Progress of the agent’s learning process

During the learning of the agent we received many failures, this was mainly related to an inadequately developed reward function. The platform instead of avoiding obstacles collided with a wall. In the end we managed to train the agent so that the platform drives correctly. The simulation of the trained algorithm is presented in Fig. 7. As can be seen, the platform spontaneously selects a path to the destination, avoiding on its way an obstacle in the form of a pillar. During the training of the agent in this case the reward function was used, directing the platform to a room located in the lower right corner of the map.

Fig. 7Driving simulation of an autonomous platform

5. Conclusions

The result of the work is a properly working algorithm that enables the mobile platform to move autonomously to a given destination. It allows for the optimal trajectory to be determined, avoiding obstacles encountered on the way. Ultimately, the algorithm will be implemented in a real AGV robot. The results of work on the hardware implementation of the created algorithm and field tests will be the subject of future publications.

The use of reinforcement learning in system control tasks works very well and significantly facilitates the implementation of such solutions. It also allows to obtain better behaviour of the robot in an unknown environment.

During the work, the learning time of the algorithm, despite the use of parallel learning, proved to be extremely problematic. This meant that even the smallest changes to the structure of the reward, actor or critic functions, or a change in the learning parameters required re-training and waiting for hours for results. This significantly affected the development time of the algorithm.

References

F. Gul, W. Rahiman, and S. S. Nazli Alhady, “A comprehensive study for robot navigation techniques,” Cogent Engineering, Vol. 6, No. 1, p. 1632046, Jan. 2019, https://doi.org/10.1080/23311916.2019.1632046

Publisher
M. Soylu, N. E. Özdemirel, and S. Kayaligil, “A self-organizing neural network approach for the single AGV routing problem,” European Journal of Operational Research, Vol. 121, No. 1, pp. 124–137, Feb. 2000, https://doi.org/10.1016/s0377-2217(99)00032-6

Publisher
J. Keighobadi, K. A. Fazeli, and M. S. Shahidi, “Self-constructing neural network modeling and control of an AGV,” Positioning, Vol. 4, No. 2, pp. 160–168, 2013, https://doi.org/10.4236/pos.2013.42016

Publisher
J. Varagul and T. Ito, “Simulation of detecting function object for AGV using computer vision with neural network,” Procedia Computer Science, Vol. 96, pp. 159–168, 2016, https://doi.org/10.1016/j.procs.2016.08.122

Publisher
Y. Bengio, “Learning deep architectures for AI,” Foundations and Trends® in Machine Learning, Vol. 2, No. 1, pp. 1–127, 2009, https://doi.org/10.1561/2200000006

Publisher
M. Gupta, “Deep learning,” in International Series in Operations Research and Management Science, Cham: Springer International Publishing, 2019, pp. 569–595, https://doi.org/10.1007/978-3-319-68837-4_17

Publisher
S. Ito, M. Soga, S. Hiratsuka, H. Matsubara, and M. Ogawa, “Quality index of supervised data for convolutional neural network-based localization,” Applied Sciences, Vol. 9, No. 10, p. 1983, May 2019, https://doi.org/10.3390/app9101983

Publisher
Z. Yan, B. Ouyang, D. Li, H. Liu, and Y. Wang, “Network intelligence empowered industrial robot control in the F-RAN environment,” IEEE Wireless Communications, Vol. 27, No. 2, pp. 58–64, Apr. 2020, https://doi.org/10.1109/mwc.001.1900346

Publisher
I. Dominik, “Advanced controlling of the prototype of SMA linear actuator,” Solid State Phenomena, Vol. 177, No. 1, pp. 93–101, Jul. 2011, https://doi.org/10.4028/www.scientific.net/ssp.177.93

Publisher
I. Dominik, “Implementation of the Type-2 fuzzy controller in PLC,” Solid State Phenomena, Vol. 164, pp. 95–98, Jun. 2010, https://doi.org/10.4028/www.scientific.net/ssp.164.95

Publisher
W. Huang, C. Yang, J. Ji, N. Chen, and K. Xu, “Study on obstacle avoidance of AGV based on fuzzy neural network,” in 2019 IEEE 3rd Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Oct. 2019, https://doi.org/10.1109/imcec46724.2019.8983996

Publisher
C.-L. Hwang, C.-C. Yang, and J. Y. Hung, “Path tracking of an autonomous ground vehicle with different payloads by hierarchical improved fuzzy dynamic sliding-mode control,” IEEE Transactions on Fuzzy Systems, Vol. 26, No. 2, pp. 899–914, Apr. 2018, https://doi.org/10.1109/tfuzz.2017.2698370

Publisher
V. F. Caridá, O. Morandin, and C. C. M. Tuma, “Approaches of fuzzy systems applied to an AGV dispatching system in a FMS,” The International Journal of Advanced Manufacturing Technology, Vol. 79, No. 1-4, pp. 615–625, Jul. 2015, https://doi.org/10.1007/s00170-015-6833-8

Publisher

Cited by

A Comprehensive Review of Recent Advances in Automated Guided Vehicle Technologies: Dynamic Obstacle Avoidance in Complex Environment Toward Autonomous Capability

Muhammad Aizat | Nurakasyah Qistina | Wan Rahiman

(2024)

Cognitive intelligence in industrial robots and manufacturing

Avishek Mukherjee | A.B. Divya | M. Sivvani | Surjya Kanta Pal

(2024)

Mechatronics – Industry-Inspired Advances

Patryk Bałazy | Paweł Knap | Szymon Podlasek

(2024)

2024 25th International Carpathian Control Conference (ICCC)

Patryk Balazy | Kamil Pieprzycki | Paweł Knap

(2024)

2024 25th International Carpathian Control Conference (ICCC)

Patryk Balazy | Szymon Podlasek

(2024)

Construction of Probability Graph Optimization Model on Basis of Localization Algorithm

Song Li | Qiuming Jiang

(2024)

Energy Consumption Analysis of the Selected Navigation Algorithms for Wheeled Mobile Robots

Adam Rapalski | Sebastian Dudzik

(2023)

Artificial intelligence, machine learning and deep learning in advanced robotics, a review

Mohsen Soori | Behrooz Arezoo | Roza Dastres

(2023)

Artificial Intelligence in Smart Logistics Cyber-Physical Systems: State-of-The-Arts and Potential Applications

Yang Liu | Xin Tao | Xin Li | Armando Walter Colombo | Shiyan Hu

(2023)

Accuracy Analysis of the Indoor Location System Based on Bluetooth Low-Energy RSSI Measurements

(2022)

About this article

Received

07 June 2021

Accepted

29 June 2021

Published

11 August 2021

DOI

https://doi.org/10.21595/rsa.2021.22109

Keywords

autonomous systems

smart sensors

neural networks

environmental understanding

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2024 03 02

A precise localization algorithm for unmanned aerial vehicles integrating visual-internal odometry and cartographer

Jiaqi Xu, Zhou Chen, Jie Chen, Jingyan Zhou, Xiaofei Du

Review article

2024 02 17

A review on positioning techniques of mobile robots

Jakub Semborski, Adam Idzkowski

Research article

2023 08 01

Challenges and opportunities for enhanced patient care with mobile robots in healthcare

Sushil Kumar Sahoo, Bibhuti Bhusan Choudhury

Research article

2023 06 25

A review on path planning ai techniques for mobile robots

Shrinivas Deshpande, Abhishek Kumar Kashyap, Bhumeshwar K. Patle

P. Bałazy, P. Gut, and P. Knap, “Positioning algorithm for AGV autonomous driving platform based on artificial neural networks,” Robotic Systems and Applications, Vol. 1, No. 2, pp. 41–45, Aug. 2021, https://doi.org/10.21595/rsa.2021.22109

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/rsa.2021.22109
UR  - https://doi.org/10.21595/rsa.2021.22109
TI  - Positioning algorithm for AGV autonomous driving platform based on artificial neural networks
T2  - Robotic Systems and Applications
AU  - Bałazy, Patryk
AU  - Gut, Paweł
AU  - Knap, Paweł
PY  - 2021
DA  - 2021/08/11
PB  - JVE International Ltd.
SP  - 41-45
IS  - 2
VL  - 1
SN  - 2669-2473
ER  - 

Copy Ris

Copied to clipboard!

@article{Ba_azy_2021,
	doi = {10.21595/rsa.2021.22109},
	url = {https://doi.org/10.21595/rsa.2021.22109},
	year = 2021,
	month = {aug},
	publisher = {{JVE} International Ltd.},
	volume = {1},
	number = {2},
	pages = {41--45},
	author = {Patryk Ba{\l}azy and Pawe{\l} Gut and Pawe{\l} Knap},
	title = {Positioning algorithm for {AGV} autonomous driving platform based on artificial neural networks},
	journal = {Robotic Systems and Applications}
}

Copy Bibtex

Copied to clipboard!

[1]P. Bałazy, P. Gut, and P. Knap, “Positioning algorithm for AGV autonomous driving platform based on artificial neural networks,” Robotic Systems and Applications, vol. 1, no. 2, pp. 41–45, Aug. 2021, doi: 10.21595/rsa.2021.22109.

Copy IEEE

Copied to clipboard!

Bałazy, Patryk, Paweł Gut, and Paweł Knap. “Positioning Algorithm for AGV Autonomous Driving Platform Based on Artificial Neural Networks.” Robotic Systems and Applications 1, no. 2 (August 11, 2021): 41–45. https://doi.org/10.21595/rsa.2021.22109.

Copy Chicago

Copied to clipboard!

Positioning algorithm for AGV autonomous driving platform based on artificial neural networks

Abstract

Highlights

1. Introduction

2. Environment model

3. Reward function

4. Training algorithm/result

5. Conclusions

References

Cited by

About this article

Related Articles