Mobile QR Code QR CODE

2025

Reject Ratio

81.5%

Main Menu

※ The user interface design of www.ieiespc.org has been recently revised and updated. Please contact inter@theieie.org for any inquiries regarding paper submission.

Journal Search

IEIESPC(IEIE Transactions on Smart Processing and Computing)

IEIESPC Vol. 15, No. 3, p.426-437

ISSN (online) :

2287-5255

Received : 26 December 2023Revised : 15 August 2024Revised : 13 October 2024Accepted : 12 May 2025

DOI :

10.5573/IEIESPC.2026.15.3.426

Regular Paper

Electric Power Steering System Identification Using Artificial Neural Network for Autonomous Vehicles

(Rodi Hartono) ^1,³ (Hyun Rok Cha) ² (Hee Tae Chung) ¹ (Kyoo Jae Shin) ^1,^*

(Department of Artificial Intelligence Convergence, Busan University of Foreign Studies, Busan 46234, South Korea. rodihartonoo@gmail.com, {htchung, kyoojae}@bufs.ac.kr)
(Automotive Materials and Components Research and Development Group, Korea Institute of Industrial Technology, Gwangju 61012, South Korea. hrcha@kitech.re.kr)
(Department of Engineering and Computer Science, Universitas Komputer Indonesia, Bandung 40132, Indonesia)

^*Corresponding Author : Kyoo Jae Shin, kyoojae@bufs.ac.kr

License :

This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.(www.theieie.org).

Abstract

The steering system is crucial for autonomous vehicles to accurately convert angle inputs into motion trajectories. Among the types of steering systems, electric power steering (EPS) has gained widespread adoption due to its superior reliability, safety, and efficiency compared to hydraulic systems. Precise steering control necessitates a deep understanding of EPS dynamics, typically achieved through mathematical modeling or system identification (SI). However, traditional modeling methods often fall short in capturing the complex nonlinear behavior of EPS. To address this limitation, we propose an artificial neural network (ANN) model trained using backpropagation (BP) to represent the dynamic characteristics of EPS. The ANN is trained on real-world data collected from a physical EPS system. Extensive testing on diverse datasets demonstrates the exceptional performance of our proposed model, achieving a remarkable fit of over 99.6% with measured data. This significant improvement over conventional methods highlights the potential of the ANN-based BP model as a superior approach for SI in EPS.

Keywords

Electric power steering, System identification, Artificial neural network, Backpropagation, Nonlinear behavior

1. Introduction

The role of the steering system in autonomous vehicles is important to translate the angle input from the sensing navigation segment into desired motion trajectories to ensure road traffic safety and a dynamic environment. In recent years, electric power steering (EPS) systems have gained popularity in modern and autonomous vehicles due to their superior attributes including reliability, safety, efficiency, and easier to control compared to hydraulic systems ^[1, ^2]. An EPS system is driven by an electric motor that provides torque. Commonly, a brushless direct current motor (BLDCM) or a permanent magnet synchronous motor (PMSM) is used as an EPS actuator that is connected to the vehicle wheels by the reduction gear and the mechanical steering rack ^[3].

In order to regulate and ensure the EPS of the vehicle responds as intended, we need to apply a control to the EPS system. To accomplish that, a controller requires knowledge about the dynamics of the EPS system and this knowledge can be captured in the form of a mathematical model ^[4, ^5]. Developing a model can be done using physics and the first principles where the mathematical equations are written out based on the understanding of the system dynamics, or it can be done by using data and fitting a model to that data with a process called system identification (SI) ^[5]. Accurate SI in EPS domains enables the estimation of key parameters such as mechanical stiffness, damping, and motor characteristics, which are essential for ensuring precise and reliable steering responses. Due to the non-linearity of the EPS system, a nonlinear SI approach needs to take place to find a model structure. Consequently, the traditional linear parameter system identifier that only handles measurable and bounded nonlinearities will cause poor dynamic performance or system instability ^[6, ^7].

With the development of modern SI and control theory technology, many machine learning and artificial intelligence (AI) approaches have been applied to bridge this complex nonlinear behavior. One of the methods by using an artificial neural network (ANN). The ANN is a method based on a biological prototype of the human brain and a type of adaptive and data-driven technique that can effectively determine the relationships between the input factors and the plant outputs. ANN can establish both linear and nonlinear relationships without making assumptions based on the activation function’s generalization capabilities. ^[8- ^16].

Despite the potential and ability of ANN as an identifier, in this study, we proposed the ANN-based backpropagation (BP) algorithm to model the dynamic of the EPS system with its nonlinearities behavior and analyze Its effectiveness. The SI datasets for training, validation and tests were obtained from a car’s steering rack with a 3-phase 600W BLDCM. Six-step trapezoidal commutation algorithm is used to rotate the motor to specific positions. Some input signals were given and the EPS response output was recorded.

This paper follows a structured approach to comprehensively explore EPS SI using ANN. In Section 2, we introduce the basic EPS system setup and problem formulations. This includes an explanation of how the EPS system is configured and an in-depth exploration of the ANN method’s concept, which is integral to this study. Section 3 focuses on the crucial step of identifying the EPS system using an ANN-based BP algorithm. Here, we explain the methodology and the proposed model design. In Section 4, we bridge theory and practice by detailing the implementation of the proposed EPS ANN SI. We discuss the steps involved in preparing EPS data for acquisition, simplifying data preprocessing, outlining the training procedures, and explaining how we evaluate the model’s performance. Moving to Section 5, we analyze the results obtained from the EPS ANN SI presented in this research. This section provides a closer look at the findings, interpreting and explaining the performance and behavior of the identified system straightforwardly. In this section, we also present a linear EPS TFE SI as a comparison. Finally, Section 6 summarizes the conclusions.

2. Basic Configurations and Problem Formulations

2.1. The Configuration of EPS System

An autonomous vehicle (AV) system utilizes a combination of sensors, including cameras, lidar, radar, and other perception systems. These sensors gather real-time data about the surrounding environment, road conditions, and potential obstacles. This sensor data is processed by the vehicle control unit segment and used to generate a comprehensive understanding of the vehicle’s position, motion, and desired trajectory. Based on the navigation unit sensors and a pre-determined navigation plan, advanced control algorithms are employed to calculate the optimal steering commands. In the EPS system unit after received a command, the steering control unit (SCU) calculates the required torque for the electric motor, which directly applies torque to the steering rack. The structure of the EPS system and mechanism can be seen in Fig. 1.

Fig. 1. The EPS system and mechanism in an AV.

2.2. ANN Method Concept

Characterizing and identifying systems is a fundamental task in systems theory, where the former involves establishing the mathematical representation of a system. This means the main challenge in SI entails determining an appropriate model structure of the experimental data to be a representation of the real system. Creating a model that can capture the dynamic behavior of the system can be done through an understanding of the essential elements of the system. And since we do not know the essential element or parameter of the physical system, we can’t write the model directly using the first principle and physics. In this situation, a black-box approach can be used. The black-box SI approach is basically constructed as a suitable identification model as shown in Fig. 2, which is subjected to the same input $u(t)$ as the plant, produce an output $\hat{y}(t)$ that approximates $y(t)$ according to desire sense, and $e(t)$ is the error between desire sense and the approximates output.

Fig. 2. General SI scheme.

The formulation of the SI model can be categorized into linear and nonlinear. In nonlinear SI, the relationship between the input $u(t)$ and the output $y(t)$ of the system can be expressed using nonlinear equations. The general form of a nonlinear SI can be written as

(1)

$ y(t) = f(u(t),\theta) +e(t), $

where $y(t)$ is the system output at time $t$, $u(t)$ denotes the input to the system at time $t$, $\theta$ represents the vector of unknown parameters that need to be estimated, $f$ is a known set of functions that relate the input and parameters to the output, and $e(t)$ represents the model error at time $t$.

This study encountered EPS with its nonlinearities behavior and proposed a nonlinear method in SI of its dynamic. The objective is to achieve a realization or approximation of the underlying dynamics $f$ by using ANN. To accomplish this, let’s introduce the input vector $v(t)$ for the network which is a confined number of the past inputs and the output as

(2)

$ v(t) = [y(t -1)\cdots y(t -n_x)u(t -1)\cdots u(t -n_y)]^T . $

In practical applications, these decisions are often influenced by the availability of prior information pertaining or training set to the plant under identification that can be written as

(3)

$ D^N = \{[y(t), v(t)] \mid t = 1,...,N\}. $

Based on the data, we deduce the correlation or connection as

(4)

$ \hat{y}(t) = f(v(t)), $

where $f$ is a function deduced from Eq. (2). In the presence of unknown physical parameters within the system, we can examine the relationship between $y$ and $v$.

The equation representation in Eq. (1) is simplified of a general non-linear system of the NARMAX model as

(5)

$ y(t) = f(y(t -1),~y(t -2),...,~y(t -n),\nonumber\\ \hskip 4pc u(t -1),~u(t -2),...,~u(t -m),\nonumber\\ \hskip 4pc e(t -1),~e(t -2),...,~e(t - p),~\theta), $

where $y(t -1), y(t -2), ..., y(t -n)$ denotes the past output values, $u(t -1), u(t -2), ..., u(t -m)$ represents the past input values, and $e(t -1), e(t -2), ..., e(t - p)$ denotes the past error or residual values. The output $y(t)$ on Eq. (1) is a function of the current input $u(t)$, the unknown parameters $\theta$, and an error term $e(t)$. This equation suggests a direct relationship between the current input and output, potentially with some error or noise. On the other hand, Eq. (5) represents a more complex nonlinear model that the output $y(t)$ depends not only on the current input $u(t)$ and the unknown parameters $\theta$ but also on the past outputs $y(t -1)$, $y(t -2), ..., y(t -n)$, and the past inputs and errors. This equation incorporates a memory or feedback mechanism, allowing for the consideration of the system’s history and potentially capturing dynamic or temporal dependencies.

In the context of SI, an ANN is used to approximate the underlying dynamics or relationship between input and output variables of a system based on observed data. It serves as a flexible and adaptive model that can capture nonlinearities and complex interactions in the system’s behavior. Learning processes in ANN refer to the mechanisms by which ANN adapts and improves their performance based on the input data and desired outputs. In Fig. 3, we can observe an illustration depicting the process of learning and error correction in an ANN.

Fig. 3. Illustrating the learning process and error correction: (a) An ANN with one output, (b) neuron output signal flow graph, and (c) flow on forward and backward propagation.

Assuming the network depicted in Fig. 3(a) comprises a single hidden layer, with inputs represented by the vector $x(z)$, and an output layer containing a neuron labeled $k$. This neuron in the output layer is driven by a signal vector $h(z)$ generated by the previous layer, and $y_k(z)$ corresponds to the predicted output associated with this configuration. In this context, the variable $z$ signifies a discrete time step, representing the iterative stages involved in adjusting the interconnection weights of the neurons within the network.

Now, let’s examine a simpler scenario involving a neuron-$k$ located in the output layer, serving as the only computational neuron, with a designated signal vector $h(z)$ as shown in detail in Fig. 3(b). After the signals have undergone forward propagation, the output of the ANN is compared to a target output represented by $g_k(z)$. This comparison leads to the derivation of an error signal, labeled as $e_k(z)$. By definition, this error signal quantifies the difference between the ANN’s output and the desired target output. Consequently, the error output signal at the output neuron-$k$ during the $z$th iteration or time step is formally expressed as

(6)

$ e_k(z) = g_k(z)-y_k(z). $

To ensure that errors in both positive and negative directions are treated equally, the error needs to be square, we thus have

(7)

$ E_k(z) = e_k^2 . $

The learning process entails iteratively modifying the weights of the network’s connections. The term iteration is defined as a complete cycle of calculations (forward and backward passes) as illustrated in Fig. 3(c). This objective is to achieve a minimize of error criterion by minimizing a cost function. Normalizing the sum of squared error with respect to the set size $N$, we get

(8)

$ \delta_{ave} = \frac{1}{N} \sum_{k=1}^N e_k^2(z). $

According to the BP algorithm, applied a weight’s correction $\Delta W_{kN}(z)$ to the weight $W_{kN}(z)$ which is proportional to the partial derivative $E_k(z)$ with respect to $W_{kN}(z)$, we get

(9)

$ \frac{\partial E_k(z)}{\partial W_{kN}(z)} = \frac{\partial E_k(z)}{\partial y_k(z)} \frac{\partial y_k(z)}{\partial u_k(z)} \frac{\partial u_k(z)}{\partial W_{kN}(z)} . $

The $W_{kN}(z)$ is the weight of the neurons in between the output layer and hidden layer (see Fig. 3(b)) and $\Delta W_{kN}(z)$ is the tuning of the weight $W_{kN}$ at time step $z$. The correction $\Delta W_{kN}(z)$ applied to $W_{kN}(z)$ is defined by

(10)

$ \Delta W_{kN}(z) = -\eta \frac{\partial E_k(z)}{\partial W_{kN}(z)} , $

where $\eta$ is learning rate. Having computed the weight adjustment $\Delta W_{kN}(z)$, the updated value of the weight $W_{kN}(z)$ is defined by

(11)

$ W_{kN}(z+1) = W_{kN}(z) +\Delta W_{kN}(z), $

where $W_{kn}(z+1)$ described a new weight.

3. Proposed SI of EPS using ANN-based BP algorithm

The process flow for characterizing the EPS system using SI through ANN and the proposed ANN structure for the EPS SI in this research are describes in Figs. 4 and 5, respectively.

Fig. 4. Process of SI of the EPS system using ANN.

The approach begins with collecting data from the EPS physical hardware system. The data includes the variables steering angle input and the EPS output response. The collected data is then preprocessed to reduce noise, remove inconsistencies, and normalize the data to a consistent range. The next step is to choose the ANN architecture that is suitable for the system’s complexity. This includes considering the number of input nodes, hidden layers, and neurons. The ANN was trained using BP algorithms to adjust the weights and biases. The training is performed iteratively using the training dataset. To evaluate its performance a separate validation dataset was used. The validation dataset is distinct from the training data and helps assess how well the model generalizes to new data by comparing the predictions with the actual system responses. Generalization testing is performed with new data to ensure that the model is applicable in real-world contexts.

Fig. 5. Proposed ANN structure for EPS SI .

Fig. 5 shows the output of the networks becomes a function of past inputs and past outputs. We can see the multi-inputs $r(z), r(z-1), ..., r(z-n)$ and $y(z-1), y(z-2), ..., y(z-m)$, where $r(z), r(z-1), ..., r(z-n)$ is the main neurons input, $y(z-1), y(z-2), ..., y(z-m)$ is the inputs from feedback’s output plant. The output $y_k(z)$ is the desired output for the ANN identifier output $\hat{y}_k(z)$. The notation $i$, $j$, and $o$ in $W_{ji}^1$, $W_{jl}^1$ and $W_{oj}^2$ represents different weight within the network. The flow of signals in the network occurs from left to right, with neuron $j$ positioned in a layer to the right of neuron $i$ and $l$, and neuron $o$ positioned in a layer to the right of neuron $j$. Specifically, neuron $j$ corresponds to a neuron in the hidden layer, while neuron $o$ represents the single neuron of the output layer.

The feedforward process in the network begins by multiplying the input of each neuron $i$ and $l$ with their corresponding weights $W_{ji}^1$ and $W_{jl}^1$, respectively. The products are then propagated to each neuron $j$ in the hidden layer. Within each neuron $j$, the calculated values from all inputs are combined through summation, where each input is multiplied by its corresponding weight. The summation result in each neuron $j$ is then subjected to an activation function, denoted as $f$. The output of the $f$ from each neuron $j$ is then multiplied by its corresponding weight of $W_{oj}^2$, and the summation of all resulting products yields the network’s output $\hat{y}_k(z)$.

Mathematically, we can write the output $\hat{y}_k(z)$ as

(12)

$ \hat{y}_k(z) = \left( \sum_{j=1}^J W_{oj}^2 h_j(z) \right), $

where

(13)

$ h_j(z) = f \left( \sum_{i=0}^n W_{ji}^1 r(z-i) + \sum_{l=1}^m W_{jl}^1 y(z-l) \right). $

In a standard ANN configuration, each neuron is associated with weights and biases. Biases help introduce flexibility in the model by shifting the activation function and allowing the model to capture nonlinear relationships in the data. Biases also enable the model to learn an intercept term, which is particularly useful when there is a non-zero mean or bias in the data. However, some data may not require biases to be modeled effectively, especially if the data is already centered around zero. This study does not use biases to model the data because the data has a natural centering around zero. Removing biases also has the benefit of reducing the number of parameters in the model, leading to a simpler network with fewer computations.

Fig. 6. Proposed training algorithm of the SI.

To explore various possibilities and find the best ANN structure for SI, we conducted several tests. We varied the number of samples and types, the number of input variables ($n$th and $m$th order), and the number of neurons in the hidden layer. The number of hidden layers were limited to one, and the activation function used was the tanh activation function. The details of the test configurations and their results are provided in Table 2, and Fig. 6 describes the proposed training algorithm of the SI.

The ANN first performs a feedforward calculation to generate the output. Then, the loss is calculated between the predicted output and the target output. The BP algorithm is used to update the weights and biases of the ANN. This process is repeated iteratively until the model achieves the desired level of performance on the training data.

Mathematically, we can define the loss function using Eq. (7). Next, to define the partial derivatives of $\frac{\partial E_k(z)}{\partial W_{oj}^2(z)}$, $\frac{\partial E_k(z)}{\partial W_{ji}^1(z)}$, $\frac{\partial E_k(z)}{\partial W_{jl}^1(z)}$ are as follows:

(14)

$ \frac{\partial E_k(z)}{\partial W_{oj}^2(z)} = \frac{\partial E_k(z)}{\partial \hat{y}_k(z)} \frac{\partial \hat{y}_k(z)}{\partial W_{oj}^2(z)} , $

(15)

$ \frac{\partial E_k(z)}{\partial W_{ji}^1(z)} = \frac{\partial E_k(z)}{\partial \hat{y}_k(z)} \frac{\partial \hat{y}_k(z)}{\partial h_j(z)} \frac{\partial h_j(z)}{\partial W_{ji}^1(z)} , $

(16)

$ \frac{\partial E_k(z)}{\partial W_{jl}^1(z)} = \frac{\partial E_k(z)}{\partial \hat{y}_k(z)} \frac{\partial \hat{y}_k(z)}{\partial h_j(z)} \frac{\partial h_j(z)}{\partial W_{jl}^1(z)} , $

and to updates the weights are as follows:

(17)

$ W_{oj}^2(z+1) = W_{oj}^2(z)-\eta \frac{\partial E_k(z)}{\partial W_{oj}^2(z)} , $

(18)

$ W_{ji}^1(z+1) = W_{ji}^1(z)-\eta \frac{\partial E_k(z)}{\partial W_{ji}^1(z)} , $

(19)

$ W_{jl}^1(z+1) = W_{jl}^1(z)-\eta \frac{\partial E_k(z)}{\partial W_{jl}^1(z)} . $

4. Implementation of the Proposed EPS ANN SI

4.1. EPS Data Acquisition Preparation

In order to provide datasets for EPS SI using ANN, the commutation code that has been selected and its sequence algorithm that had been uploaded to a microcontroller is performed. The microcontroller is connected to an upper computer through serial communication to record the motor response based on some input signal given. For the code generation, we use MATLAB/Simulink integrated with NXP model-based design toolbox (MBDT). To control the motor speed is done by a PWM technique. The 12VDC of a maximum input voltage is given to rotate the motor due to the motor selection that meets the vehicle voltage environment. The actual motor angle position as feedback is measured by a magnetic position sensor AS5247U through the index pulse output-A/B index (ABI) interface. The EPS steering input command is given by the upper computer and at the same time the data acquisition is recorded every 0.032 seconds. This experimental data collection scheme can be seen in Fig. 7.

Fig. 7. Experimental data collection scheme.

Fig. 8 illustrates the experimental setup, including all hardware and components involved in data recording. The upper computer handles data recording via UART communication using FreeMASTER software. A 50A power supply provides power to the microcontroller, power inverter, and BLDCM. Additionally, Fig. 9 shows the EPS MATLAB/Simulink design block code, which reflects the implementation of the data acquisition scheme for recording.

Fig. 8. Experimental hardware and components setup for data recording.

Fig. 9. EPS MATLAB/Simulink design block code for EPS data recording.

4.2. EPS Data Acquisition Results and Data Preprocessing

The datasets for training, validation and test are recorded from the real EPS as described previously, the form of step variations, sine wave, square wave, and triangular wave were recorded for the EPS SI in this research. These datasets configurations are shown in Table 1.

Table 1. The training, validation, and test datasets for SI.

Input signal	Record’s duration [sec]	Number of data samples	Remark
Step variations	63.42	1982	For training data
Step variations-2	38.08	1190	For validation data
Sine wave	56.77	1774	For testing data
Square wave	79.65	2489	For testing data
Triangular wave	59.81	1869	For testing data

In the further step, the datasets that had been taken above are then normalized with min-max scaling in the $\pm 1$ range of scale. It is determined that the data in range from 0 to $+1$ is the clockwise direction data and the data range below 0 to $-1$ is the counterclockwise direction data. Data normalization is performed to help improve the convergence and stability of optimization algorithms. The choice of normalization method in this research mathematically can be written as

(20)

$ x_{out} = (b-a) \frac{x-\min x}{\max x-\min x} +a, $

where $a$ and $b$ are the desired range which is $a$ is $-1$ and $b$ is 1, $x$ is the scalar that needs to be scaled, $\min x$ or $\max x$ is the minimum and maximum of the vectors data.

4.3. Training Procedures and Model Evaluation

In this research, MATLAB Editor was used to create the code for training, validating, and testing the ANN model. MATLAB Editor is a powerful scientific computing tool with advanced debugging capabilities. To explore various ANN structure possibilities and understand the complex performance trade-offs, several tests were conducted. The details of the test configurations are provided in Table 2.

Table 1 shows that the first dataset was used for training, the second dataset was used for validation, and the remaining three datasets were used for testing. We conducted two sets of learning iterations, each with a different number of times that the optimization algorithm updated the model’s parameters. We also selected two different values of the learning rate, which determines the step size or the rate at which the algorithm updates the model’s parameters in the direction of the steepest descent of the loss function.

Introduced the term "model order", which refers to the number of recurrent units or time steps used in this proposed network architecture. For example, in a simple ANN with a model order of 1, the network only considers the current input and the information from the previous time step to predict the output at the next time step. In contrast, if the model order is set to 5, the ANN will take into account the current input and the information from the past five steps to make predictions. Ultimately, three sets of experiments with different numbers of neurons in the hidden layer to investigate the effect on generalization capabilities were conducted. These experiments were designed to assess the performance and generalization capabilities of the proposed ANN structure.

The ANN was trained using the data collection of step variations. During the training phase, the model learned from the training set to make predictions and minimize prediction errors using BP techniques. this process helps to obtain global optimal network parameters. After training the ANN to produce a model based on the training dataset, it is essential to evaluate its performance on a separate validation dataset. The validation dataset is distinct from the training data and helps assess how well the model generalizes to new data. Next, the model is tested on some unseen data, known as the test datasets. This final evaluation provides a realistic estimation of how well the model will perform in real-world scenarios. The different types of datasets are described in detail in Table 1 and the final matrices of the weighs after the training process are summarized as follows:

(21)

$ W_{ji}^1 = \begin{bmatrix} W_{11}^1 & W_{12}^1 & W_{13}^1 & W_{14}^1 & W_{15}^1 \\ W_{21}^1 & W_{22}^1 & W_{23}^1 & W_{24}^1 & W_{25}^1 \\ W_{31}^1 & W_{32}^1 & W_{33}^1 & W_{34}^1 & W_{35}^1 \end{bmatrix}\nonumber\\ = \begin{bmatrix} -0.3765 & -0.2851 & -0.3549 & -1.7821 & -0.8568 \\ 0.1864 & 0.0866 & -0.2532 & -0.2622 & 0.5646 \\ -0.0581 & 0.0363 & 0.0363 & -0.4767 & -0.7341 \end{bmatrix}, $

(22)

$ W_{jl}^1 = \begin{bmatrix} W_{11}^1 & W_{12}^1 & W_{13}^1 & W_{14}^1 & W_{15}^1 \\ W_{21}^1 & W_{22}^1 & W_{23}^1 & W_{24}^1 & W_{25}^1 \end{bmatrix}\nonumber\\ = \begin{bmatrix} 0.5453 & 0.8190 & -0.5027 & 0.7813 & -0.1826 \\ 0.5756 & 0.0507 & -0.4942 & -1.9341 & 0.2997 \end{bmatrix}, $

(23)

$ W_{oj}^2 = \begin{bmatrix} W_{11}^2 \\ W_{21}^2 \\ W_{31}^2 \\ W_{41}^2 \\ W_{51}^2 \end{bmatrix} = \begin{bmatrix} 0.7149 \\ 0.7032 \\ -0.7104 \\ 0.1242 \\ -0.4050 \end{bmatrix}. $

Next, after receiving the predicted model results, we can calculate the percentage of the model’s best fits, also known as the goodness of fit, between the EPS model’s predictions and the EPS actual measured data. This can be done using the MSE concept. In the context of SI, the model’s predicted output values are compared to the actual measured output values, and the MSE is computed to quantify the differences between the two.

Fig. 10. Performance evaluation algorithm for a predictive model.

Lower values of MSE indicate a better fit, meaning the model’s predictions closely match the measured data. The percentage of best fits can be calculated based on the ratio of the minimized error MSE for the model to the total variance of the measured output data. The percentage indicates how well the model captures the variation in the data. A higher percentage indicates a better fit and a closer match between the model and the actual data. This performance evaluation algorithm for the predictive model is shown in Fig. 10.

5. Experimental Results

5.1. Analysis of the EPS TFE System

One common linear method used for SI is the TFE. This method is used to model and estimate the dynamics of a linear time-invariant (LTI) system, where the relationship between inputs and outputs remains constant over time and is unaffected by changes in the system’s operating conditions. TFE provides a mathematical representation of the system’s input $U(s)$-output $Y(s)$ relationship in the frequency domain.

(24)

$ H(s) = \frac{Y(s)}{U(s)} = \frac{\omega_n^2}{s^2 +2\zeta\omega_ns+\omega_n^2} , $

where $\omega_n$ is the natural frequency and $\zeta$ is the damping ratio.

The same provided angle position observations data as described in Table 1 in this research are given to validate, and test the model produced by the TFE. To measure the model’s best fits, we also use the MSE method. The percentage of best fits are calculated based on the ratio of the minimized error MSE for the model to the total variance of the measured output data. This TFE resulted 87.6%, 92.1%, and 87.3% of best fits input-output model of given input signals of sine wave, square wave, and triangular wave, respectively. Fig. 11 shows the performance results between EPS actual observe outputs and the transfer function model’s estimation.

Fig. 11. The time response results for original EPS and simplified 2nd order TFE model (with structure: number of poles is 2, number of zero is 0, search method through gradient descent with maximum iterations of 100 and tolerance of 0.1): (a) Sine wave response, (b) square wave response, and (c) triangular response.

5.2. Analysis of the EPS ANN Results

Selecting an ANN structure requires careful consideration of factors such as task complexity, data characteristics, overfitting, computational resources, training time and generalization capabilities. All configurations described in Table 2 have a fast-training time process, but the setting structure number 11 is a good construction because it strikes a balance between model complexity and the problem’s requirements. It has a fast-training time and is able to achieve good generalization performance. The test model’s result for setting structure number 11 at learning rate of 0.1 is shown in Fig. 12.

Fig. 12. The time response results for original EPS and identified ANN model (network topology 5-5-1, learning iteration=100, and $\eta = 0.1$): (a) Sine wave response, (b) square wave response, and (c) triangular response.

Based on the test results, the best-fit model in this SI study was achieved using an artificial neural network (ANN) with design configuration number 11, as shown in Fig. 12. This configuration, detailed in Table 2, features a network topology of 5-5-1 and a learning rate of 0.1. At both learning rates of 0.1 and 0.01, the model achieved an optimal fit, with predictions matching the actual measured data by over 99.6% across all testing data.

The data in Table 2 also highlight the importance of selecting an appropriate model order, as it significantly impacts the ANN’s ability to capture temporal dependencies and make accurate predictions. A higher model order allows the ANN to consider longer-term dependencies in the data, but it also increases computational complexity and the risk of overfitting, where the model memorizes noise rather than learning general patterns. Configuration number 11 uses a model order of 2, which strikes a good balance between capturing dependencies and avoiding overfitting.

The selection of the number of iterations is also crucial. It controls the computational process and balances the trade-offs between computation time and solution accuracy. For example, comparing data from configurations 1 and 10, we see that at 100 iterations, the optimization algorithm had sufficient time to update the model’s parameters and enhance its performance. Finally, choosing an appropriate learning rate is essential. A higher learning rate allows the optimization algorithm to make larger updates to the model’s parameters, potentially leading to faster convergence during training. However, if the learning rate is too high, the optimization process may overshoot the optimal parameters and fail to converge. Conversely, a very low learning rate may result in slow convergence and extended training times.

Table 2. Performance evaluation of predicted outputs of the EPS ANN SI under various configuration.

No	Learning iterations	System order	Neurons in hidden layer	Test data
				Sine wave		Square wave		Tiangular wave
				Best-fit at $\eta$ 0.1 (%)	Best-fit at $\eta$ 0.01 (%)	Best-fit at $\eta$ 0.1 (%)	Best-fit at $\eta$ 0.01 (%)	Best-fit at $\eta$ 0.1 (%)	Best-fit at $\eta$ 0.01 (%)
1	10	2	3	94.9	97.8	96.0	98.1	95.4	98.0
2			5	98.7	97.4	99.7	97.7	98.9	97.8
3			9	64.1	97.9	74.4	99.1	65.9	98.1
4		3	3	95.5	94.7	96.3	95.4	15.3	77.1
5			5	86.0	98.4	89.6	99.1	67.5	-615.4
6			9	98.4	98.9	98.8	98.5	14.5	-233.6
7		5	3	58.3	90.5	68.0	89.7	67.4	-63.6
8			5	96.5	92.3	96.2	93.0	84.3	55.4
9			9	83.6	96.1	96.2	95.3	31.0	-219.1
10	100	2	3	99.4	99.3	99.6	99.8	99.5	99.4
11			5	99.7	99.7	99.9	99.9	99.8	99.8
12			9	99.3	99.5	99.4	99.9	99.4	99.7
13		3	3	99.2	99.3	99.5	99.8	78.1	-164.8
14			5	99.6	99.5	99.9	99.8	78.1	-107.3
15			9	99.6	99.5	99.7	99.9	-135.9	45.5
16		5	3	98.9	97.9	99.3	98.2	57.3	46.2
17			5	98.3	99.0	99.3	99.3	-8.6	0.8
18			9	96.9	99.6	96.8	99.6	7.7	-666.6

6. Conclusion

The research emphasizes the effectiveness of employing ANN-based BP algorithms for modeling EPS system dynamics. Through meticulous analysis, it demonstrates the ANN’s superior performance in capturing nonlinear behavior compared to traditional linear methods and alternative modeling approaches like TFE. The chosen ANN architecture strikes a balance between model complexity and accuracy, achieving over 99.6% fit with measured data. This significantly contrasts with the TFE, which, for the same input signals, yielded significantly lower accuracies: 87.6%, 92.1%, and 87.3%. This comparison underscores the substantial advantage of ANN-based methods for accurately modeling the intricate dynamics of EPS systems.

Acknowledgment

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ICAN (ICT Challenge and Advanced Network of HRD) program (IITP-2023-2020-0-01825) supervised by the IITP (Institute of Information & Communications Technology Planning & Evaluation).

References

B. Paden , M. Cáp , S. Z. Yong , D. Yershov , E. Frazzoli , A survey of motion planning and control techniques for self-driving urban vehicles, IEEE Transactions on Intelligent Vehicles, Vol. 1, No. 1, pp. 33-55, 2016

M. Yildirim , M. Polat , H. Kürüm , A survey on comparison of electric motor types and drives used for electric vehicles, Proc. of 2014 16th International Power Electronics and Motion Control Conference and Exposition, pp. 218-223, 2014

A.-M. Nicorici , M. Ruba , C. S. Marţiş , L. Szabó , Z. Máthé , Comparative analysis of permanent magnet synchronous machines designed for electric power steering applications, 2020 XI International Conference on Electrical Power Drive Systems, pp. 1-6, 2020

J. A. K. Suykens , J. Vandewalle , , Non-linear Modelling: Advanced Black-box Techniques, pp. 209, 1998

L. Ljung , , System Identification: Theory for the User, pp. 1, 1999

K. S. Narendra , K. Parthasarathy , Identification and control of dynamical systems using neural networks, IEEE Transactions on Neural Networks, Vol. 1, No. 1, pp. 4-27, 1990

T. Yamada , T. Yabuta , Dynamic system identification using neural networks, IEEE Transactions on Systems, Man, and Cybernetics, Vol. 23, No. 1, pp. 204-211, 1993

A. Haddoun , M. E. H. Benbouzid , D. Diallo , R. Abdessemed , J. Ghouili , K. Srairi , Comparative analysis of estimation techniques of SFOC induction motor for electric vehicles, Proc. of 2008 18th International Conference on Electrical Machines, pp. 1-6, 2008

L. Fu , P. Li , The research survey of system identification method, Proc. of 2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics, pp. 397-401, 2013

S. Chen , S. A. Billings , Neural networks for non-linear dynamic system modelling and identification, International Journal of Control, Vol. 56, No. 2, pp. 319-346, 1992

A. Yazdizadeh , K. Khorasani , Identification of a class of nonlinear systems using dynamic neural network structures, Proc. of the International Conference on Neural Networks, Vol. 1, pp. 194-198, 1997

K. Chernyshov , A non-linear MIMO system identification approach based on the multiple maximal correlation technique, Proc. of 2021 International Russian Automation Conference, pp. 950-955, 2021

M. M. Quamar , M. Aldhaifallah , Instrumental variable system identification for time-delayed system with non-integer time-delay, Proc. of 2020 17th International Multi-Conference on Systems, Signals and Devices, pp. 97-102, 2020

Y. Naung , A. Schagin , H. L. Oo , K. Z. Ye , Z. M. Khaing , Implementation of data driven control system of DC motor by using system identification process, Proc. of 2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering, pp. 1801-1804, 2018

Data-driven control and learning systems, IEEE Transactions on Industrial Electronics, Vol. 64, No. 5, pp. 4070-4075, 2017

C. Wang , M. Wang , T. Liu , D. J. Hill , Learning from ISS-modular adaptive NN control of nonlinear strict-feedback systems, IEEE Transactions on Neural Networks and Learning Systems, Vol. 23, No. 10, pp. 1539-1550, 2012

Rodi Hartono

Rodi Hartono received his B.S. degree in electrical engineering from Universitas Komputer Indonesia, Indonesia, in 2010 and his M.S. degree in electrical engineering from Institut Teknologi Bandung, Indonesia, in 2014. Since 2021, he has been pursuing a Ph.D. degree at the Department of Artificial Intelligence Convergence, Busan University of Foreign Studies, Republic of Korea. (E-mail: rodihartonoo@gmail.com)

Hyun Rok Cha

Hyun Rok Cha received his Ph.D. degree in physics from the Tokyo Institute of Technology, Tokyo, Japan, in 2009. He was with the Samsung Electronics Research Center, Gwangju, for a period of four years. Since 2004, he has been a Senior Researcher with the Automotive Materials & Components R&D Group, Korea Institute of Industrial Technology. He is also a current executive managing director of the Seonam division in the Korea Institute of Industrial Technology and a professor at the University of Science and Technology. His research interests include E-mobility, electric vehicle platforms, smart vehicle control, hybrid-powered drones, and power electronics. (E-mail: hrcha@kitech.re.kr)

Hee Tae Chung

Hee Tae Chung received his M.S. and Ph.D. degrees in electronic engineering from Kyungpook National University, Daegu, Korea, in 1988 and 1996, respectively. Between 1996 and 1997, he worked as a Patent Examiner at the Korean Industrial Property Office. Currently, he is a professor of electronic and robot engineering at Busan University of Foreign Studies, Korea. His current research interests include the application of intelligent control to robot systems, adaptive control, and deep learning with neural networks. (E-mail: htchung@bufs.ac.kr)

Kyoo Jae Shin

Kyoo Jae Shin is a professor of intelligent robot science at Busan University of Foreign Studies, South Korea. He is the director of Future Creative Science Research Institute at BUFS. He received his B.S. degree in electronics engineering in 1985, his M.S. degree in electrical engineering from Cheonbuk National University in 1988, and his Ph.D. degree in electrical science from Pusan National University in 2009. His research interests include intelligent robots, image signal processing application systems, smart farms, and aquariums using new energy and IoT technology. (E-mail: kyoojae@bufs.ac.kr)

IEIE SPC IEIE Transactions on Smart Processing & Computing

Journal Search

Journal XML

Journal Information

Electric Power Steering System Identification Using Artificial Neural Network for Autonomous Vehicles

Abstract

Keywords

1. Introduction

2. Basic Configurations and Problem Formulations

2.1. The Configuration of EPS System

2.2. ANN Method Concept

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

3. Proposed SI of EPS using ANN-based BP algorithm

(12)

(13)

(14)

(15)

(16)

(17)

(18)

(19)

4. Implementation of the Proposed EPS ANN SI

4.1. EPS Data Acquisition Preparation

4.2. EPS Data Acquisition Results and Data Preprocessing

(20)

4.3. Training Procedures and Model Evaluation

(21)

(22)

(23)

5. Experimental Results

5.1. Analysis of the EPS TFE System

(24)

5.2. Analysis of the EPS ANN Results

6. Conclusion

Acknowledgment

References

Rodi Hartono

Hyun Rok Cha

Hee Tae Chung

Kyoo Jae Shin

Article Information (continued)

Keywords

IEIE SPC

IEIE Transactions on Smart Processing & Computing