Rural micro-credit model design and credit risk assessment via improved LSTM algorithm

Xia Gao; Xiaoqian Yang; Yuchen Zhao

doi:10.7717/peerj-cs.1588

Rural micro-credit model design and credit risk assessment via improved LSTM algorithm

Xia Gao, Xiaoqian Yang , Yuchen Zhao

Business School, University of Jinan, Jinan, Shandong, China

DOI: 10.7717/peerj-cs.1588

Published: 2023-09-26
Accepted: 2023-08-21
Received: 2023-06-26

Academic Editor: Osama Sohaib

Subject Areas: Algorithms and Analysis of Algorithms, Social Computing
Keywords: Rural microcredit, Long short term memory (LSTM), Credit risk assessment models, Self organizing LSTM algorithm

Copyright: © 2023 Gao et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Gao X, Yang X, Zhao Y. 2023. Rural micro-credit model design and credit risk assessment via improved LSTM algorithm. PeerJ Computer Science 9:e1588 https://doi.org/10.7717/peerj-cs.1588

The authors have chosen to make the review history of this article public.

Abstract

Rural microcredit plays an important role in promoting rural economic development and increasing farmers’ income. However, traditional credit risk assessment models may have insufficient adaptability in rural areas. This study is based on the improved Long Short Term Memory (LSTM) algorithm using self organizing method, aiming to design an optimized evaluation model for rural microcredit risk. The improved LSTM algorithm can better capture the long-term dependence between the borrower’s historical behavior and risk factors with its advantages in sequential data modeling. The experimental results show that the rural microcredit risk assessment model based on the self organizing LSTM algorithm has higher accuracy and stability compared to traditional models, and can effectively control credit default risk, providing more comprehensive risk management support for financial institutions. In addition, the model also has real-time monitoring and warning functions, which helps financial institutions adjust their decisions in a timely manner and reduce credit losses. The practical application of this study is expected to promote the stable development of rural economy and the advancement of financial technology. However, future work needs to further validate the practical application effectiveness and interpretability of the model, taking into account the special circumstances of different rural areas, in order to achieve sustainable application of the model in the rural microcredit market.

Introduction

Rural microcredit is an important component of the rural financial system, which can effectively alleviate the funding needs in the agricultural production process and increase their income (Wang & Li, 2023; Ma et al., 2023). However, due to the current information asymmetry problem in the credit loan market, most commercial banks have certain deficiencies in the risk management of rural micro credit loans (Murta & Gama, 2022; Aramonte, Lee & Stebunovs, 2022). In terms of the credit risk management of farmers’ microfinance, the low accuracy of farmers’ credit risk prediction and the lack of scientific credit risk assessment system have all brought significant credit risk risks to farmers’ microfinance business (Alhassan et al., 2023; Lu et al., 2022; Alemu & Zerhun, 2023). In addition, due to the fact that the economic activities in rural areas are mostly agricultural production, there are certain natural and market risks. Small loans often face high default risks, so how to effectively manage loan risks is a challenge. To address the lack of credit and risk assessment in the microlending process for farmers, which is crucial for the regulation of financial services, it is urgently necessary to build a trustworthy credit and risk assessment system.

The majority of credit risk assessment models have caught the interest of many academics in recent years, and these models are built utilizing statistical and machine learning techniques. When the feature space is big, the accuracy of conventional random forest and logistic regression approaches is poor (Zheng, Liu & Ge, 2022; Leonard et al., 2022; Oh et al., 2022). Convolutional neural networks is also commonly used for speech recognition software and computer vision simulation (Zhao, 2022; Dua et al., 2022). The rapid growth of big data has accelerated the advancement of neural networks. There has been tremendous advancement in a variety of classification and regression research results in sectors relevant to convolutional neural networks.

Methods including artificial neural networks, genomic planning, support vector machines, logical regression, and some hybrid models have achieved significant advancements in the field of credit risk assessment in terms of performance and accuracy. Many great algorithms and research techniques have been employed in the field of credit risk assessment for many trials over the past few years based on customer information data (Machado & Karray, 2022; Shi et al., 2022).

The number of neurons in the buried layer of neural networks is a challenging topic that never goes away. If there aren’t enough neurons, the network can’t be trained well and learning becomes challenging. Conversely, if there are too many neurons, overfitting is a common occurrence, which causes a significant increase in training time and complexity across the board. The prediction accuracy can be kept at a high level if the Long Short Term Memory (LSTM) network’s hidden layer neuron count stays within a particular range (Wu et al., 2021; Xu et al., 2022). In order to obtain better prediction results, it is crucial to make sure the number of hidden layer neurons is within this range while building LSTM neural network models. This research employs a sensitivity-based self-organizing algorithm to automatically modify the number of hidden layer neurons during training in order to address the issue. Hidden layer neurons are created when the LSTM network’s output falls short of the intended level, and they are removed when their sensitivity falls short of a predetermined threshold. To our best knowledge, the main contributions of this article are as follows

Improved LSTM algorithm: The traditional LSTM performs well in sequence data modeling, but there may be some deficiencies in rural microfinance data modeling. The improved LSTM algorithm can propose new network structures or optimization strategies based on the characteristics of rural credit risk, in order to improve the performance and accuracy of the model.
Introducing rural credit risk factors: Rural credit risk assessment needs to consider many factors specific to rural areas, such as crop yields, weather effects, market fluctuations, etc. The improved model can introduce these rural credit risk factors and incorporate them into the decision-making process of credit risk assessment, thereby improving the predictive ability of the model.
Real time and dynamic updates: Rural credit risk assessment needs to have the characteristics of real-time and dynamic updates. The improved model can design corresponding mechanisms to achieve real-time updates of the model to adapt to changes in rural credit risks.

As a result, this chapter provides a thorough introduction to the sensitivity algorithm and suggests an LSTM model based on a self-organizing algorithm. Finally, experiments are used to validate this model’s prediction performance.

Related Work

Credit risk assessment is the earliest developed financial risk management tool, and it is also the most successful link in the application of statistics and operational research in the financial industry. Generally, scoring techniques are used to comprehensively evaluate the credit performance of borrowers in terms of future repayment, etc. By using disciplines such as statistics, operational research, and data mining technology, the basic attribute characteristics, historical credit records based on systematic analysis of massive data such as behavioral information. By mining the credit and behavioral characteristics present in these data, creating a prediction model, quantitatively calculating the evaluation object’s default probability, or assigning specific credit evaluation scores, one can determine the relationship between future credit performance and historical past records and determine the evaluation object’s risk profile.

Model construction and data analysis are, in general, the two basic techniques used in data research (Xu, Yan & Cui, 2022; Zhang et al., 2022). According to the author’s empirical findings in Collier & Hampshire (2010), the higher the ratio of the loan amount to annual income, in terms of credit evaluation indicators, the lower the possibility of securing a loan and the higher the loan interest rate. In Lin, Li & Zheng (2017), empirical research techniques list a number of particular evaluation indicators, such as gender, age, marital status, loan size, and so forth.

Banks can evaluate the risks that loan applicants pose using credit scoring. In other terms, it enables financial institutions to vouch for the legitimacy of loan seekers (Ping, 2016). Credit institutions always work to reduce the chance that borrowers won’t be able to repay their loans. Therefore, based on specific criteria, credit scoring enables banks to mitigate these risks. The reliability and financial stability of borrowers are relevant to the stated criteria. Relevant academics have conducted a number of studies to evaluate credit risk. In Malekipirbazari & Aksakalli (2015), the author developed a credit risk assessment model using decision support tools, with the major determinants being loan interest rates, the annual income of the lender, and the length of the borrower’s repayment period. In Rajamohamed & Manokaran (2018), the author examined how social ties affected credit appraisal and discovered that both borrower borrowing success and default risk are significantly influenced by social ties. Loan groups and more soft information do help to lower the credit risk associated with online lending, but as loan group leaders frequently have a monopoly rent due to their access to more information and decision-making authority, this raises borrower costs and default risk. Additionally, soft information may also reveal the educational background of the lenders.

A new combination strategy based on classifier consistency to combine several classifiers was proposed in Ala’raj & Abbod (2016) for credit evaluation. Through the use of credit score data, the author confirmed the model’s good predictive performance. One of the algorithms used to determine credit risk is the random forest algorithm. According to research, the predictions made using random forests are highly precise and accurate. Relevant academics suggested a continuous rule extraction strategy for credit scoring in order to increase the model’s precision and interpretability.

According to pertinent research, it is currently impossible to accurately and completely explain the factors that influence borrowers’ default, and a lack of understanding of the data has caused the model’s prediction accuracy to be low. As a result, it is challenging to meet the demands of big data for credit analysis and identification of borrowers. In order to create credit risk assessment models fast and reliably, the improved LSTM method proposed in this article.

Firstly, we will provide a detailed description of the architecture of the proposed self-organizing LSTM risk assessment model. This includes the composition of the input layer, hidden layer, and output layer of the model, as well as the specific parameter settings for each layer. We will explain how self-organizing algorithms automatically adjust the number of neurons and why they introduce rural specific risk factors and multi-level feature learning mechanisms. At the same time, we will describe the implementation method of real-time monitoring and early warning, as well as how the model dynamically updates the borrower’s risk assessment.

Then, we will provide a detailed introduction to the experimental setup, including the selection and preprocessing of the dataset, the partitioning of the training set, validation set, and test set, as well as the selection of other neural network algorithms for comparison. We will describe the training process and hyperparameter adjustment methods of the model, and explain the selection and significance of indicators for model evaluation.

Finally, we will list the performance of the model on evaluation indicators such as accuracy, precision, recall, F1 score, and AUC value, and compare it with other neural network algorithms. We will focus on analyzing the advantages of self-organizing LSTM algorithm in these indicators, as well as how to solve the problems of inaccurate credit ratings and insufficient risk estimation. We will also discuss the performance of the model at different risk levels and explain how the features and mechanisms in the model can help improve accuracy and estimation.

LSTM Algorithm Based on Self-Organization

LSTM algorithm

To address RNN’s gradient disappearance issue, Hochreiter & Schmidhuber (1997) introduced LSTM as a specific circulating neural network. Three control gates were first introduced by the network, which also established a node structure distinct from that of regular neurons. Figure 1 depicts the general structure of this node, which is referred to as an LSTM unit.

Figure 1: Overall structure of the LSTM unit.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-1

Figure 2: Internal structure diagram of LSTM unit.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-2

The four input components of the LSTM unit are input, control signals for input gates, forgetting gates, and output gate control signals. LSTM cells have a parameter amount that is four times greater than that of regular neurons. Figure 2 depicts a more thorough internal structure so that the LSTM unit’s operating principle can be understood in greater depth.

Where Z is input, Z_i is the input gate control signal, Z_f is the forgetting gate control signal, and the Z_o is the output gate control signal. The f(x) function is typically a sigmoid function (1) $f (x) = \frac{1}{1 + e^{- x}} .$

The value between [0,1] that represents the door’s opening degree can be controlled by the sigmoid function. The activation functions g(x) and h(x) are identical.

First, Z gets g(Z) through the activation function g(x), Z_i gets g(Z_i) through the sigmoid function, and Z_f gets g(Z_f) through the sigmoid function. c′ gets h(c′) through the activation function h(x), Z_o gets g(Z_o) through the sigmoid function.

Compared to RNN, LSTM networks replace hidden layer neurons in RNN with LSTM cells, resulting in a fourfold increase in input parameters and a change in output format. Put the LSTM unit into the entire network structure, and the computing structure is shown in Fig. 3.

Figure 3: Overall structure diagram of LSTM network.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-3

Where x_t and h_t−1 together serve as input portions of the hidden layer at time t, multiply different weight vectors and pass through the activation function to obtain the control signals Z_f, Z_i, and Z_o of the three gates, and the input value Z. As a result $Z_{f} = W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}$ $Z_{i} = W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}$ $Z_{o} = W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}$ (2) $Z = W \cdot [h_{t - 1}, x_{t}] + b_{x}$ where b_f, b_i, b_o, and b_x are offsets for different connection weights. After a series of operations by the LSTM unit, the value c stored in the memory unit is updated, have (3) $c^{'} = g (W_{x} \cdot [h_{t - 1}, x_{t}] + b_{x}) f (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) + c f (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}) .$

As seen above, in LSTM neural networks, the historical information that affects the subsequent instant includes both the values stored in the memory cells of the LSTM cells as well as the output of the hidden layer neurons at the preceding moment. When compared to RNN, LSTM neural networks can remember more historical knowledge, minimize gradient disappearance, and better suit the trend of long-term time series data.

Sensitivity calculation

This article uses an algorithm based on sensitivity to conduct self organizing training for LSTM neural networks. The red arrow indicates the self feedback part of the hidden layer neuron, that is, the output of the hidden layer neuron at time t, i.e., z₁(t) = (H₁(t − 1), H₂(t − 2), …, H_N(t − 1)), and the blue arrow indicates the output portion from the hidden layer to the output layer, that is, the output of the hidden layer neuron at time t, i.e., z₂(t) = (H₁(t), H₂(t), …, H_N(t)).

The concept of sensitivity is as follows: (4) $S_{h} = \frac{V a r [E (Y | Z_{h})]}{V a r (Y)}, (h = 1, 2, \dots, N)$

where Z_h represents the h-th input factor, Y is the output layer output of the model, A represents the expected output of Y at a fixed value of hZ, and X represents the calculated variance.

For the LSTM neural network model constructed in this article, sensitivity analysis is divided into two calculation parts: indirect sensitivity and direct sensitivity. The red self feedback section in Fig. 4.1 is used to calculate indirect sensitivity, while the blue output section is used to calculate direct sensitivity. Therefore, according to Formula (4), the calculation formula for indirect sensitivity is as follows: (5) $S_{h}^{1} (t) = \frac{V a r_{h} [E (Y (t) | Z_{h}^{1} = H_{h} (t - 1))]}{V a r (y (t))}$ where H_h(t − 1) represents the output of the h-th hidden layer neuron at time t − 1 and y(t)represents the output of the network output layer. We have (6) $H_{h} (t - 1) = h_{t - 1} (c^{'}) f_{t - 1} (Z_{o}) = h (g (W_{x} \cdot [H_{h} (t - 2), x_{t}] + b_{x}) f (W_{i} \cdot [H_{h} (t - 2), x_{t}] + b_{i}) + c f (W_{f} \cdot [H_{h} (t - 2), x_{t}] + b_{f})) \times f (W_{o} \cdot [H_{h} (t - 2), x_{t}] + b_{o})$ where W_x, W_i, W_f and W_o represent the unit input weight, the weight of the input gate control signal, the weight of the forgotten gate control signal, and the weight of the output gate control signal, respectively. b_x, b_i, b_f and b_o represent input bias, input gate control signal bias, forgetting gate control signal bias, and output gate control signal bias, respectively.

Figure 4: Exploded View of LSTM Network.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-4

The following is how the output of the output layer is represented. (7) $y (t) = \sum_{j = 1}^{N} W_{j} (t) H_{j} (t)$ where W_j represents the connection weight of the jth hidden layer neuron to the output layer at time t.

The direct sensitivity is calculated as follows: (8) $S_{h}^{2} (t) = \frac{V a r_{h} [E (y (t) | Z_{h}^{2} = H_{h} (t))]}{V a r (y (t))} .$

The difference between direct sensitivity and indirect sensitivity is that the condition for obtaining the desired output is replaced by the output of the h-th hidden layer neuron at time t. Then, we have (9) $H_{h} (t) = h_{t} (c^{'}) f_{t} (Z_{o}) = h (g (W_{x} \cdot [H_{h} (t - 1), x_{t}] + b_{x}) f (W_{i} \cdot [H_{h} (t - 1), x_{t}] + b_{i}) + c f (W_{f} \cdot [H_{h} (t - 1), x_{t}] + b_{f})) \times f (W_{o} \cdot [H_{h} (t - 1), x_{t}] + b_{o}) .$

After obtaining the indirect sensitivity and the direct sensitivity, the overall sensitivity is obtained by the following formula (10) $S_{h} (t) = S_{h}^{1} (t) + S_{h}^{2} (t) .$

Self-organizing LSTM algorithm

The self-organizing LSTM algorithm proposed in this research is based on the sensitivity analysis calculations. Figure 5 displays the flow of the algorithm.

Figure 5: Process based on LSTM risk assessment.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-5

First, randomly initialize the hidden layer’s network parameters and neuron count. Next, the neural network’s training phase begins. The network constantly engages in iterative training as long as neither the number of iterations nor the output value of the loss function has reached the predetermined threshold value of a = 2. This network’s loss function is described as follows: (11) $E (t) = \sqrt{\frac{1}{2 t} \sum_{p = 1}^{t} {(y_{d} (p) - y (p))}^{2}}$ where y_d(p) and y(p) represent the expected output and actual output of the network at time p. In each iterative training, four values must be calculated first: output value E(t) of loss function (see (11)); indirect sensitivity $S_{h}^{1}$ of hidden layer neurons (see (5)); direct sensitivity $S_{h}^{2}$ of hidden layer neurons (see (8)); total sensitivity S_h of hidden layer neurons (see (10)).

When E(t) > ξ(t), it indicates that the performance of the network has not achieved the desired effect, and the N+1 LSTM unit needs to be added. Then, the weight initialization of the LSTM unit is as follows: $W_{N + 1_{x}}^{1} (t) = W_{n_{x}}^{1} (t)$ $W_{N + 1_{i}}^{1} (t) = W_{n_{i}}^{1} (t)$ $W_{N + 1_{f}}^{1} (t) = W_{n_{f}}^{1} (t)$ $W_{N + 1_{o}}^{1} (t) = W_{n_{o}}^{1} (t)$ $W_{N + 1_{s}}^{1} (t) = W_{n_{s}}^{1} (t)$ (12) $W_{N + 1}^{2} (t) = W_{n}^{2} (t)$ where m represents the input weight of the new LSTM unit; $W_{N + 1_{i}}^{1}$ represents the input gate control signal of the new LSTM unit; $W_{N + 1_{f}}^{1}$ represents the forgetting gate control signal of the new LSTM unit; $W_{N + 1_{o}}^{1}$ represents the output gate control signal of the new LSTM unit; $W_{N + 1_{s}}^{1}$ represents the weight of the output self feedback loop of the new LSTM unit; $W_{N + 1}^{2}$ represents the connection weight of the unit to the output layer; n represents the neuron with the highest total sensitivity among the existing hidden layer units.

In addition, when S_n < ξ, the m-th neuron needs to be deleted. The weights of this neuron are all 0, that is (13) $W_{n_{x}}^{1} (t) = W_{n_{i}}^{1} (t) = W_{n_{f}}^{1} (t) = W_{n_{o}}^{1} (t) = W_{n_{s}}^{1} (t) = W_{n}^{2} (t) = 0 .$

At this point, the operation of adding and deleting the entire neuron is completed.

Figures 5 and 6 illustrate the fundamental steps involved in putting the LSTM-based risk assessment approach into practice.

Figure 6: Self organizing LSTM algorithm flowchart.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-6

The main steps of the algorithm proposed in this article are as follows:

Step 1: Data collection and preprocessing: Firstly, collect data related to rural microcredit, including borrower’s personal information, historical loan records, repayment status, etc. Then, preprocess the data, including Data cleansing, missing value processing, Outlier detection, etc., to ensure the quality and integrity of the data.

Step 2: Feature engineering: After pretreatment, feature engineering is required to extract the characteristics that have an impact on rural microfinance risk.

Step 3: Model training and optimization: Divide the dataset into training and testing sets, train the model using the training set, and evaluate the model using the testing set. In the training process, it may be necessary to adjust the super parameters of the model, such as Learning rate, batch size, etc., to optimize the model performance.

Step 4: Risk assessment and prediction: After model training, the model can be used to conduct risk assessment and credit default prediction for new borrowers. By leveraging the predictive power of the model, it is possible to better control the default risk of rural microcredit and optimize lending decisions.

The self-organizing LSTM algorithm automatically adjusts the number of neurons by introducing self-organizing algorithms, thereby improving the performance of the model. Specifically, self-organizing algorithms dynamically increase or decrease the number of neurons during model training based on the characteristics and complexity of input data to optimize model performance.

The following are the steps for self-organizing LSTM to automatically adjust the number of neurons to improve model performance:

Step 1: Initial number of neurons setting: In the model construction phase, an initial number of neurons needs to be set first. This quantity can be a fixed value selected based on experience, or determined through some heuristic method or adaptive algorithm.

Step 2: Retraining the model: After increasing the number of neurons, the model needs to be retrained to learn new network structures and parameters. During the retraining process, the model will adapt to more complex data characteristics based on the number of new neurons.

Step 3: Performance monitoring: After the model is retrained, the self-organizing algorithm will evaluate the performance indicators of the model again. If performance improves, continue to maintain the current number of neurons. If there is no significant improvement in performance, the self-organizing algorithm may continue to increase the number of neurons until the model achieves satisfactory performance.

Through this self-organizing approach, self-organizing LSTM can dynamically optimize the number of neurons, enabling the model to better adapt to the needs of different scenarios and datasets. This automatic adjustment mechanism helps to improve the performance of the model, making self-organizing LSTM perform better in tasks such as rural microcredit risk assessment.

Experimental Analysis

Experimental preparation

The sample selected in this study is based on the data from a bank’s microfinance database since 2008, and 166 samples are randomly selected. According to the five levels of classification of bank loans, risks are classified into five categories: normal, concerned, secondary, suspicious, and loss. If the principal or interest is overdue for 3–6 months during the supervision and use of the loan, the asset is recognized as a subprime loan, and the probability of loan loss is 30% to 50%. Therefore, in the selection of samples, loans with risks classified as sub prime and above are identified as default samples. We list the initial indicator variables in Table 1.

Table 1:

Initial indicator variable.

Quantitative indicators

Borrower’s age; contact number; The amount of the loan; Loan term; Family population; Supporting and supporting the population; Housing property; Production and operation property; Other properties (household cars, savings deposits, etc.); Cultivated land area; Annual agricultural income; Annual non agricultural income; Total household expenditure in the previous year; Amount of liabilities; Estimated total household income during the borrowing period; Estimated total household income during the borrowing period.

Qualitative indicators

Purpose of the loan; Education; Marital status; Bad credit record

DOI: 10.7717/peerjcs.1588/table-1

In this article, the samples are divided into default samples and non default samples according to whether they are in default, with 91 default samples and 75 non default samples. Training and testing phases make up the LSTM. The performance of the model is correlated with the amount of training samples. In the training stage, there are 112 samples, including 60 default samples and 52 non default samples; in the testing stage, there are 54 samples, including 31 default samples and 23 non default samples.

We are unable to get the dataset used in earlier investigations since the dataset for these studies contains consumer privacy information. Therefore, to gather and collate the dataset in the customer database, we employed the dimension definition method of the dataset indicated in earlier studies. Firstly, the technology proposed in this article was cross validated ten times based on the dataset compiled using the Khashman dimension definition method. The ten cross validation accuracy results are shown in Fig. 7.

Figure 7: Ten cross validation accuracy results (%).

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-7

Figure 7 demonstrates that the test group had a high classification accuracy rate, with an average classification accuracy rate of 92.97%.

As can be seen from Table 2, the addition and deletion of neurons are very frequent, and the sensitivity based self-organization algorithm is triggered multiple times during the training process, thus verifying the effectiveness of the self-organization algorithm.

Table 2:

Quantitative analysis of different methods.

	BP neural network	RNN	LSTM	Proposed method
Accuracy	71.5%	78.0%	83.2%	92.6%
Precision	83,2%	88.7%	92.4%	98.8%
Recall	81.1%	86.4%	90.3%	96.2%
F1 score	78.2%	79.1%	82.3%	89.2%
AUC value	0.34	0.45	0.72	0.88

DOI: 10.7717/peerjcs.1588/table-2

Experimental comparison

We used BP neural network, RNN, LSTM, and the method suggested in this article for comparative experiments to demonstrate the superiority of the algorithm presented in this article.

First, the convergence rates of various techniques are contrasted, with the findings displayed in Fig. 8. There have been 675 iterations of the BP neural network algorithm, 411 iterations of the RNN algorithm, and 186 iterations of the conventional LSTM algorithm. The method outlined in this piece, however, achieves convergence after fewer than 100 iterations. As can be seen, the technique suggested in this article has a faster convergence rate than other methods.

Figure 8: Convergence speed of different algorithms.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-8

Second, we test the algorithm’s superiority using various data definition dimension techniques. The findings of ten times of cross-validation of these technologies using the Khashman’s dimension defining method, Bekhet’s dimension definition method, and the dimensions definition method in Zhang et al. (2017) are shown in Figs. 9, 10 and 11, respectively.

Figure 9: Results of the trial based on the Khashman’s dimensions definition technique.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-9

Figure 10: Results of the trial based on Bekhet’s dimensions definition technique.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-10

Figure 11: Results of the experiment based on Zhang et al. (2017)’s dimensions definition technique.

Download full-size image

DOI: 10.7717/peerjcs.1588/fig-11

The method for dimension definition that we suggested makes up for the deficiencies of earlier datasets of consumer information by being comprehensive, objective, accurate, and suitable.

The three sets of experiments mentioned above were compared, and the comparison revealed the following analysis findings:

(1) There was no variation in the expression of customer information across the various data sets used in these experiments, even though they all contained collections of customer information.

(2) Khashman uses a wide range of dimensional definition methods, but they fall short in terms of efficiency and objectivity. For instance, determining whether an employee is an overseas worker or not, or their phone number, can affect how their credit is evaluated. Therefore, the foundation for increasing classification accuracy is an objective and thorough dimension definition technique. Khashman’s dimensional definition has been greatly improved by about 5% using the improved LSTM method suggested in this article. However, there is still space for development.

(3) Bekhet & Eletter (2014) suggested a more impartial approach to describing dimensions but did not support it. However, there is also a dearth of connection between them because each dimension of the model is independent from the others. As a result, the classification accuracy of these algorithms has increased by almost 5% when compared to the tests based on the dataset compiled by the Khashman method, but there is still room for improvement.

(4) The literature Zhang et al. (2017) suggested a comprehensive, objective, and accurate method for defining dimensions, which compensated for the shortcomings of the above methods, to address the aforementioned issues. The classification accuracy of our suggested approach has increased by almost 5% as a result of the benefits of self-organizing LSTM models in learning the relationships between elements based on this dataset.

Extended experiment

To compare the proposed self-organizing LSTM algorithm with previous neural network algorithms, we can select a dataset suitable for rural microcredit risk assessment and use a series of evaluation indicators to quantitatively evaluate the performance of each algorithm. The following is an extended experimental setup:

Evaluation indicators:

Select a suitable set of evaluation metrics to compare the performance of different algorithms. Common evaluation indicators can include:

Accuracy: The proportion of correctly predicted samples to the total sample size, which measures the overall accuracy of the model’s prediction.

Precision: The proportion of true positive cases to predicted positive cases, measuring the accuracy of the model’s prediction of positive cases.

Recall rate: The proportion of real cases to actual positive cases, measuring the model’s ability to recognize positive cases.

F1 score: Taking into account both accuracy and recall metrics, a higher F1 score means that the model performs well in both accuracy and recall.

Receiver operating characteristic and AUC value: The Receiver operating characteristic represents the relationship between sensitivity and 1-specificity, and the AUC value represents the area under the Receiver operating characteristic, which is a comprehensive model evaluation index. Divide the selected dataset into training and testing sets. Train the model using self-organizing LSTM algorithm and other neural network algorithms, and tune each algorithm based on the validation set. Use a test set for model evaluation, and calculate the performance of each algorithm on evaluation indicators such as accuracy, precision, recall, F1 score, and AUC value. Compare the performance of various algorithms on different evaluation indicators, and analyze the advantages of self-organizing LSTM algorithm in rural microcredit risk assessment.

The experimental results show that the self-organizing LSTM algorithm outperforms other neural network algorithms in evaluation indicators such as accuracy, F1 score, and AUC value. This indicates that self-organizing LSTM can more accurately predict the credit status of borrowers and is more effective in risk assessment of rural microcredit. The self-organizing LSTM algorithm improves the performance and adaptability of the model by dynamically adjusting the number of neurons to better adapt to the needs of different scenarios and datasets.

Discussions

The method advanced in this article holds profound significance and substantial potential in practical applications.

(i) Exploring the risk assessment model of the enhanced LSTM algorithm can augment the precision and specificity of credit risk analysis, facilitating rural financial institutions in achieving superior control over default risk, minimizing credit losses, and propelling the steadfast growth of the rural microcredit market.

(ii) The integration of the risk assessment model with the refined LSTM algorithm can foster rural financial innovation, elevating the level of intelligence and personalization in financial services. This, in turn, shall ameliorate the financing landscape for farmers and expedite the modernization of the entire rural economy.

(iii) The improved model embraces a comprehensive consideration of distinctive risk factors specific to rural settings, encompassing aspects such as crop yields, weather fluctuations, and market dynamics. By doing so, it can accurately gauge the risks associated with rural microcredit, leading to the formulation of targeted risk management strategies.

The benefits for the overall development of the rural economy are manifold:

(i) Advancement of a stable rural economy, driving the upgrading and diversified evolution of rural industries, fostering increased employment opportunities, and mitigating the rural poverty rate.

(ii) Optimization of the allocation of financial resources, enhancing the efficiency and accessibility of rural financial services, and bolstering the resilience and competitiveness of the rural economy.

(iii) The incorporation of risk assessment models to mitigate default risk in rural microcredit and ensure financial stability contributes significantly to the sustainability and prosperity of the rural economy.

Conclusion

The field of credit risk assessment is given a redesign and definition using a self-organizing LSTM model in this research. The method suggested in this research improves all evaluation indicators at various scales when compared to other methods, proving the value of using this LSTM model. According to the experimental findings, the technique suggested in this article has significant gains in accuracy and other areas in addition to having a faster convergence speed. future work should focus on further optimizing data collection and processing methods. There may be some difficulties in obtaining data in rural areas, and it is necessary to develop more intelligent and efficient data collection techniques while dealing with noise and missing values in rural data to improve the robustness and accuracy of the model.

Supplemental Information

Code

DOI: 10.7717/peerj-cs.1588/supp-1

Download

Data set

DOI: 10.7717/peerj-cs.1588/supp-2

Download

[1] Ala’raj M, Abbod MF. 2016. Classifiers consensus system approach for credit scoring. Knowledge-Based Systems 104:89-105

[2] Alemu A, Zerhun G. 2023. Impact analysis of formal microcredit on income of borrowers in rural areas of sidama region, Ethiopia: A propensity score matching approach. Journal of the Knowledge Economy 14(1):65-85

[3] Alhassan Y, Paul G, Salia S, Nwagbara U. 2023. Transforming local economies through microfinance in Rural Ghana: evidence from a review of existing literature. Transforming Economies Through Microfinance in Developing Nations 1-19

[4] Aramonte S, Lee S-J, Stebunovs V. 2022. Risk taking and low longer-term interest rates: evidence from the US syndicated term loan market. Journal of Banking & Finance 138:105511

[5] Bekhet H, Eletter S. 2014. Credit risk assessment model for Jordanian commercial banks: neural scoring approach. Review of Development Finance 4(1):20-28

[6] Collier BC, Hampshire R. 2010. Sending mixed signals: multilevel reputation effects in peer-to-peer lending markets. In: Proceedings of the 2010 ACM conference on computer supported cooperative work.

[7] Dua S, Albagory Y, Kumar SS, Ramalingam R, Dumka A, Singh R, Rashid M, Gehlot A, Sultan S. 2022. Developing a speech recognition system for recognizing tonal speech signals using a convolutional neural network. Applied Sciences 12(12):6223

[8] Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural Computation 9(8):1735-1780

[9] Leonard G, South C, Balentine C, Porembka M, Mansour J, Wang S, Zeh H, Augustine M. 2022. Machine learning improves prediction over logistic regression on resected colon cancer patients. Journal of Surgical Research 275:181-193

[10] Lin X, Li X, Zheng Z. 2017. Evaluating borrower’s default risk in peer-to-peer lending: evidence from a lending platform in China. Applied Economics 49(35):3538-3545

[11] Lu Y, Yu L, Li W, Aleksandrova M. 2022. Impacts and synergies of weather index insurance and microcredit in rural areas: a systematic review. Environmental Research Letters 17:103002

[12] Ma M, Meng W, Li Y, Huang B. 2023. Impact of dual credit policy on new energy vehicles technology innovation with information asymmetry. Applied Energy 332:120524

[13] Machado MR, Karray S. 2022. Assessing credit risk of commercial customers using hybrid machine learning algorithms. Expert Systems with Applications 200:116889

[14] Malekipirbazari M, Aksakalli V. 2015. Risk assessment in social lending via random forests. Expert Systems with Applications 42(10):4621-4631

[15] Murta FS, Gama PM. 2022. Does financial literacy grease the wheels of the loans market? A note. Studies in Economics and Finance 39.2:331-341

[16] Oh G, Song J, Park H, Na C. 2022. Evaluation of random forest in crime prediction: comparing three-layered random forest and logistic regression. Deviant Behavior 43(9):1036-1049

[17] Ping G. 2016. A study on commercial bank sustainable development countermeasures in the face of real estate credit risk in China. International Journal of Simulation–Systems, Science & Technology 17:12

[18] Rajamohamed R, Manokaran J. 2018. Improved credit card churn prediction based on rough clustering and supervised learning techniques. Cluster Computing 21(1):65-77

[19] Shi S, Tse R, Luo W, D’Addona S, Pau G. 2022. Machine learning-driven credit risk: a systemic review. Neural Computing and Applications 34(17):14327-14339

[20] Wang J, Li R. 2023. Asymmetric information in peer-to-peer lending: empirical evidence from China. Finance Research Letters 51:103452

[21] Wu X, Liu Z, Yin L, Zheng W, Song L, Tian B, Liu S. 2021. A haze prediction model in chengdu based on LSTM. Atmosphere 12(11):1479

[22] Xu W, Sun HY, Awaga AL, Yan Y, Cui YJ. 2022. Optimization approaches for solving production scheduling problem: a brief overview and a case study for hybrid flow shop using genetic algorithms. Advances in Production Engineering & Management 17(1):45-56

[23] Xu Y, Hu C, Wu Q, Jian S, Li Z, Chen Y, Zhang G, Zhang Z, Wang S. 2022. Research on particle swarm optimization in LSTM neural networks for rainfall-runoff simulation. Journal of Hydrology 608:127553