Adapting Altman's bankruptcy prediction model to the compositional data methodology

Adapti ng A ltman’ s bankruptcy pre dictio n model to the co mp os it i on a l d at a met h odo log y Fatemeh Keivani , Germ à C o enders * , Ge ò rgia Escaram ís Univer sitat de Gir ona, department of economics, Girona, Spa in March 2026 Abstract : Using standar d financial rati os as vari ab les in stati stical analy ses has been relat ed to several serious problems , suc h as extreme outliers, asymmetry , non - normali ty , and non- linearity . The compositional - data method ology has been successfully applied to solve thes e problems and has always yielded substantially different results when compar ed to standard finan cial rat ios. An under - researched area i s the use of fin ancial log- ratios computed with the compositional - data methodology to predict bankruptcy or the related terms o f bus iness de f ault, insolvenc y or failur e. Anothe r under - r esearched area is the use of machine learning methods i n combination with c ompositional log- ratios. The present art icle adapt s t he cla ssical Altma n bankruptcy pr ediction model and some of its extensions to the compos itional methodology with pairwis e log - ratios and thr ee common statis tical and machin e learning too ls : logistic r eg ression mode ls , k- nearest neighbours , and random forests , and compa res the results wit h standard financial ratios . Data f rom the sector in the Spanish e conomy with the larges t number of bankrupt fi rms according to the first two d igits of the NACE code (46XX “wholes ale tr ade, except of m otor vehicle s and mot orcycl es”) were o btained f ro m the I berian Balance sh eet Anal ysis System . The sample s ize (31,131 f irms, of w hich 9 7 were bankr upt) was d ivided into a training and a validation d atas et. The training data se t was downs ampled to one healthy fir m to each bankrupt fir m . No outliers were removed. Focusing on predictive performance, t he resu lt s show that compos itional methods are better than standard ratios in te rms of se nsitivity , with mixed res ul ts regarding specificity , compos itional random for ests a nd compositional logistic regression beha ving the best. Keyword s : Artificia l in telligenc e, da ta mining, big data, accounting ratios, log it mo dels, credit scoring, CoDa , SABI. JEL codes: C25, C45, C46, G32, G33, M41 . MSC cod es: 62H30, 62J12, 62P05 , 68 T 99. * Corresponding author: ge rma.coenders@udg.edu Keiv ani el al. ( 2026 ) , arX iv . 2 Intr oduction Using s tandard financial ratios as variables in statistic al models h as been relat ed to sever e violations of the models’ assumptions , such as extr em e outli ers , asymmet ry , connect ed to it sever e non -normality of the distributions, non- li nearity of the relationships, and even dependence of the results on the arbitrary decisio n regarding which accounting figure appears in the nu merator and which in the deno minator of the ratio ( Carre ras - Simó & Coenders, 2021; Coenders & Arimany - Se rrat, in press ; Durana et al . , 2025; I otti et al., 2024; L inar es - Mustarós e t al., 2018; 2022; Magrini , 2025 ) . The results of many sta tistical and machine learning analyses are in valid when all or some of these problems occur and the r esults and conclusions of said analyses can be affected to a great extent. T h e well - known Compositional Data ( CoDa) methodol ogy using log - ratio tr ans f ormations (Aitchison, 198 2 ; 1986; Coenders et al., 2023 a ; Gre en acre et al., 2023 ; Pawlowsky -Glahn et al., 2015 ) has been successf ul ly applied in connection with financial ratios over the l ast 10 years, to solve these problems ( Coenders, 2025; Coe nders & Arimany - Serrat, 2025; i n pr ess). This methodology was developed to deal with fixed - sum data in chemical and geological analys is, but the fixed sum is by no m eans ess ential. CoDa are contemp orar ily defined as arrays o f strictly positive numbers for which ratios between them a re considered to be relevant ( E gozcue & Pawlowsky - G lahn, 2019) , which pe r fectly fits the notion of f inancia l state ment analysis . Far from be ing a statistical re finement, the CoDa methodology leads to very subs tantial differ ences in the analysis resul ts whenever it has been compar ed wit h s t andard fin ancial ratios (Arimany - S errat et al., 2022; Carrer as - S im ó & Coenders, 2021; Coende rs & Arimany - Serrat, 2025 ; i n pre ss; Coenders et al ., 2023b; Creixans - T enas et al . , 2019; Dao et al., 2024; Escar amís & Arbussà , 2025; Her nandez - Romero & Coende r s, in press; Jo fre - Campuzano & Coenders, 2022; L inares - Must arós et al. , 2018; 2022; Magrini, 2025 ; Mola s - Colomer et al., 2024 ). An under - researched ar ea is that of the use of f inanci al - statement ratio s to predict outcom e variab les (Coenders & Ar imany - Serrat, 2 025 ; Molas - Colomer et al. , 2026 ) such as bankruptcy or the related terms of business defaul t , insolven cy or failur e . The only known ins tance of application o f the C oDa methodology here is that by Ma gr ini (2025 ) both using logistic r egr ession models , LASSO r egularization , an d random fo r ests . Magrini’ s work is also the first to c ombine a f inancial application of the CoDa methodology with machine learning . In bankruptc y prediction studies, p redictive accuracy bec omes a key pa rameter , beside s fulfilment or violation of the as sumptions. Magrini did not find subs tantial differences in a ccuracy betwee n standard and compositional methods. Since the seminal work by Altman (1968) , the proble m of using financial ratios to p redict bankruptcy has never ceased to attract interest ( Hanun & F erdiani, 2025; Insani et al. , 2024; Magrini, 2025; Martin - Melero et al ., 2025; Nugroho & Dewa yanto, 2025; Soukal et al., 2024 ; Staňková & Hampel, 2023; T aoushianis , 2025; V eganzones & S é verin , 2021 ; W iller do Prado et al. , 2016 ). The o riginal Altman’ s model and s ome popular vari ants Keiv ani el al. ( 2026 ) , arX iv . 3 continue to be used a nd a dapted to new machine le arning tools , a. k.a . statis tical learning , data mining , a n d artific ial intelligen ce tools (M arti n - Meler o et al., 202 5). T he literature on bankruptcy prediction with machine learning methods is growing apace ( Akusta & Gün, 2025; Antar et al., 2025; Anta r & T aya chi, 2025; Asif et al., 2025; Bal zano & Magrini, 2025; Chan et al . , 2025; Chen et a l., 20 25; Chohan et al., 2026; Cojocaru, & Ionescu, 2025; Dam & Nghiem, 2026; Dumitresc u et al., 2025; Gabrielli e t al., 2026; Gajdosikova et a l., 2026; Gnip et al., 2025; Gwalani et al . , 2025; Hussain et al., 2026; Kanojia & Arora , 2025; Khotimah & W anti W idodo, 2026; Kristanti et al., 2025; Lin et al., 2025; Liu , 2025 a ; L i u et al., 2025; L iu, 202 5b ; Martin - Melero et al., 2025 ; Mat suma ru & Katagiri, 2025; T anaka e t a l., 2025a; 2025b ; T hot a et al., 2026 ; V eganzones et al ., 2026; W ang et al., 2026; X u, 2026; Yu a n , 2025; Zhao, 2025. Zhu et al., 2026 ) . O f ten , eliminating a vast amount o f outliers ( Balza no & Magrini, 202 5 ; Magr i ni, 2025 ) , ev en in t he r ange o f one third o f the or iginal s ample (Martin - Melero et al., 2025) , is the p rice to be paid for using standa rd financial ratios , as some machine learning methods a r e relative ly immune to outliers, but no t all o f them . Machine lear ning t ools can be adapted to the CoDa methodology ( T olosana- Delgad o et al., 2019; T r an, 2025). The present ar ticle compares Altman’ s classical bankruptcy prediction model and some of its extensions with the CoDa methodology an d three common s tatistic al and machine le arning tools : lo gistic reg ress ion models , random fores ts , and k- near est neighbours . Ra ther than focus ing on the violation of the ass umptions, we focus on comparing predictive performance. Our approach complements that of Magrini (2025) , firstly , by using the full s ample without tri mming outlie rs, which is poss i ble in the Co Da methodology , as i t tends t o produce few or no outliers ( L in ar e s - Mustarós et al., 2018 ) . Random forests are relative immune to outliers (Hia et al ., 2023), but logistic regression , being a parametric s tatistical mode l, and k - nearest neighbours , being a distance - based method, are no t . Secondly , by downsampling of healthy firm s , which is known to improve pr edicti ve performance when ther e are f ew bankrupt fir ms ( Antar & T ayachi, 2025; Cojoca ru & I onesc u, 2025; Gabrielli et al . , 2026; Gnip et al., 2025; Kanojia & Aror a, 2025; Liu et al., 2025; Ma rtin - Meler o et al., 2025; T anaka et al., 2025a ; 2025b; V eganzones et al., 2026). T o this end , we randomly select one healthy firm for each bankrupt fi r m in the training sample. T hird l y , by using an alternative log - ratio transformation te rme d pairwis e log -ratios (henceforth plr ; Greenacre, 201 9 ). Materi als and m ethods Standar d and CoDa r epr esentation of Alt man’ s mo del for bankruptcy pr ediction According to Altman (1968 ), the following five s tandard f inancial ratios are good predictors of bankruptcy: working capital ratio: (CA - CL)/ (N CA+CA) , retai ned earni ngs over assets: RE/( NCA+CA), re tur n o n assets: ( OR - OE)/( NCA+CA), (1) net worth ove r liab ilitie s: (NCA+CA - NCL - CL )/( NCL+CL ), turnover: O R /(NCA+CA), Keiv ani el al. ( 2026 ) , arX iv . 4 where NCA stands for non - cur ren t assets, CA f or cur rent assets, RE f or retained earnin gs, NCL for non - current liabilities , CL for current l iabilities, OR f or operating revenue and OE for operating expenses . In the origin al Altman’ s model, the c ompany marke t value is used inste ad of the book value of net worth ( NCA+CA - NCL - CL). The book value is used in this article as commonly d one, being available f or all c om panies, be they traded in the stock excha nge or not. Altman’ s model has been re for mulated m any times. T he mos t popul ar s uch reformul ations are arguably those by Amat el al. (2008) ; G r over and Lavin (2001); Springate (1978) ; and Zmijewski (1983). These formulations add or substitute five more standard finan cial ratios: p rofit ove r curren t liabili ties: (O R- OE) /CL , c urrent ratio: CA/CL, i nver ted lever age ratio : (NCA +CA - NCL - CL)/(NC A+CA), (2) return on equity : (O R- OE )/( NCA+CA - N CL - CL) , indebtedness : ( NCL +CL) /( NCA+CA ). The starting point of the CoDa methodology is not the ratios , but a composition of the D positive accountin g fig ures needed to compute them, which are call ed par ts or components. Positiveness is ess ential. For instanc e, one cannot use operating pro fit (O R- OE) which may be nega t ive, but the always positive operating revenue (O R ) and operat ing exp enses (OE) separatel y (Cr eixans - T enas et al., 2019). In o ur case the D =7 parts in the composition are NCA, CA, RE , NCL, CL , O R and OE. The D p art s ar e then subject to log - r atio transformatio ns that c ontain all the rel ative information among the m (L inar es - Mustarós e t al., 2018; Coenders & Arim any - Serr at, in pr ess). The simples t approach to the C oDa methodology in financial an alysis is transforming the accounting figures by means of plr ( Creix ans - T en as et al., 2019; Coenders & Arimany - Serr at, in press; Farre ras - Nogu er et al . , i n p r e ss ; Greenacre, 20 19 ; Mulet - Fort eza et al., in pr ess ) . This involves d efining D - 1 l o g - ratios be tween only two accounting figures, so that each acc ount ing figure can be connected to any ot her (exhaustiveness) in only one way (non- r edundancy) thr ough the log - ratios. The D - 1=6 plr below follow both rules and can be interpreted according to c ommon financial concepts: ass et tangibility: log(NCA/C A), curr ent - asset turnover: log(O R /CA), margin: log(O R /OE) , current ratio: log(CA/CL), (3) debt maturity: log(NCL/CL), retained earnings over non - c ur rent liabilities: log (RE/NCL ). As recommended by Greena cre (2019), if we d raw a graph in which the accounting figures are the ve rtices (nodes) and the pl r a re the c onnections (edges), it is eas y to see that a n y accounting figure can be joined to any o ther through the plr in only one way , so that t he graph is connected and acyclic. For instance, NCA and O R are connected to each Keiv ani el al. ( 2026 ) , arX iv . 5 other only through the log ( NCA/CA) and log ( OR /CA ) p l r. Edges can be drawn as a rr ows without affecting the graph pr operties, the arrows pointing at the numerator of the plr for clarification purposes. Figure 1. Connected a cyclic graph among the D =7 acc ounting figures D - 1 plr can routinely be us ed as a se t of predicto rs in s tatistical model s , including logistic regression models (Coe nders & Greenacre, 2023; Coenders & Pawlows ky - Glahn, 2020). Furthermor e , the predictions are invar iant to the particu lar cho ice of D - 1 plr , as long as each acc ount ing figure can be c onnected to any other in only one way (Coenders & Pawlowsky - Glahn, 2020). More th an D -1 pl r would le ad to pe r fect collinearity . The distances among companies do depend on the plr choice. Rather th an selecting D - 1 plr , using all the possible D ( D -1)/2 plr is a better option for methods base d on di stance s, such as the k - nearest - neighbour method, because Euclidean distances computed on all the possible D ( D - 1)/2 plr ar e equivalent t o the stan dard d istance measur e for CoDa, term ed Aitchison’s distanc e ( Aitchison, 1983; Aitchis on et al., 2000) . M e thods which have the ability to s elect th e best predictors from a large ava ilable set , such as random forests , will also benefit from us ing all the pos sible D ( D - 1)/2 plr. Besi des, with random forests a nd k - nearest neighbours , t he predictions depend on the plr choice when using only of D - 1 of them , which makes results arbitrary and non - r eplicab le. Which accounting figure goes to the n umerator or the denominator of the plr does not change th e results. In our case, the set of D ( D -1)/2 =21 possible plr is: NCL OR CA NCA log( NCA/CA) RE OE CL log( RE / NCL ) log( NCL / CL ) log( CA / CL ) log(OR/ OE ) log( OR / CA ) Keiv ani el al. ( 2026 ) , arX iv . 6 log(NCA/CA), log (NCA /RE ), log(NCA/NCL ), log (NCA /CL ), log(NCA/O R ), log(NCA/OE ), log (CA/ RE) , log (CA/ NCL ), log (CA/ CL) , log(OR/ CA ), log(CA/OE ), (4) log (RE /NCL ), log( RE /CL ), log(RE/OR ), log(RE/OE), log (NCL /CL ), log(NCL/OR ), log( NCL/OE ), log(CL/OR ), log(CL/OE), log(OR /OE) . Thus, the 10 predictors in Equation s ( 1) and (2) w e re used for the standard - ratio approach under all three prediction methods. The predictors in Equation ( 3) w er e used for logistic regression under the compositional approac h, an d the predictors in Equation (4) were used for random fo r ests and k - nearest neighbours under the compositional approach. Statistic al analys is The logistic regres sion model was fi tted with the gl m function in t he basi s R sof twa re (R core team, 2025). The k - n earest - neighbour class ification was carried out with the knn function, which is part of the class R library ( V enables & Ripley , 2002) by tuning the best k value. T h e r andom- for est classificat ion used t he randomForest function in the homonymous R l ibrary (Breiman, 2001), by gener ating 100 trees a nd randomly selecting 4 predictors in the standard - ratio approac h and 7 in the compositional approach, which is about one third of the 10 standa rd and 21 compos it ional ratios. T he sampl e was randomly divided into a training subset ( 70 % of the sample) and a validation subset to as sess predictive q uality (30 % of the sample) . The training subse t was downs ampled and one hea lthy firm wa s randomly selecte d for each bankrupt firm. The validation subset wa s kept entirel y . Thr ee commo n precision measur es computed from the validation subset are: 1. p r ed ictive accu racy (over all per centage o f correct predi ctions) , 2. sens itivity ( percent age of t r ue bankr upt companies predicted as so), 3. spec ificity ( percen tage of tr ue heal thy com panies predicted as so) , 4. balanced accur acy (arit hmeti c average of sensitiv ity and s pecificity) . Keiv ani el al. ( 2026 ) , arX iv . 7 Data collection and p re -pr ocessing W e def in ed a company as being bankrupt if under cr edit ors’ meeting ( “concurso de acreedor es” in Sp ain). To predict i f a company was bankr upt in the last av ailable y ear , the standard and compos itional ratio values of the year immediat ely before wer e u sed . Data gathering and p r e -processing were as follows : 1. Data fr om the sector in the Spanish economy w ith t he largest number of firms under creditors’ meetings in 2024 accordin g to t he first two digits o f the NACE code (4 6 XX “w holesale t rade, except of mot or vehicles and m otor cycles ” ) were selected. Filt ers incl uded having 5 or more empl oyees and data availabl e until at least 2023 . The data w ere obtained on 6/10/2025 fr o m t h e Iberian Balance sheet Ana lysi s Sy stem ( SA BI , access i ble at ht tps://sabi. bvdinf o. com/) database, developed by INFORMA D&B in collaboration wi th Bureau V an Dijk. 2. The sample size was 31,131 f irms, of which 97 we r e bankrupt. 3. No outliers w ere be remov ed, only inactiv e f ir ms having either zero total assets (NCA+CA) , zer o OR , an d zero OE duri ng the last available year or t he year befor e the last . 4. The r emaini ng zero s w ere imputed with the l og - ratio EM method ( Palar ea - Albalade jo & Martín - Fer nández, 2008) avai lable in the freewar e Co DaPack (Co mas - Cufí and Th ió - Henestrosa, 2 011; downloa dable at https://ima.udg. edu/codapac k/ ). The per centages of zero s were 1. 38 % for NCA , 0.01 % for CA , 5.54 % for RE, 22.58 % for NCL, and 0.05 % f or CL . 5. The sample was randomly divided into a training set (70 % of cases ) and a validation set (30 % of cas es). The tr aining set had 21,791 fi rms, of which 69 wer e bankrupt. T he validation s et had 9, 340 firms, of which 2 8 were bankrupt. T he downsampled training set thus contained 69 healthy and 69 bankrupt firms, and the validation set 9,312 healthy and 28 bankrupt fi r ms. Th e same s et s were used for all three methods bo t h for the stand ard and co mpositional approaches. T able 1 shows the sk ewness and ku rto sis statistics of the s tandard ratios in Equations (1) and (2) , and the plr in Equation (4) f or the full samp le. Sk ewness serves as a measure of asymmetry , and kurtosis as a mea sure of heavy distribution tails, in other words , out liers. A normal distribution uncontaminated by outliers has z ero skewnes s and kurtosis. All standard ratios but one have ve r y high s kewness and extr eme kurtosis , making them o n the paper unfit fo r statis tical modelling and non - robus t machine learning m ethods except after pruning outliers . Pr un ing outliers following the standard criterion ( e. g. , Ma rtin - Melero et al., 2025) of values above the thir d quar tile plus 1.5×( inter quartile range ) or below the firs t quartile minus 1.5×( inte rquartile r ange ) would r esult in dropping 9,544 cas es from the full s ample, which is 3 0. 7 % of the companies , thus s er iously impai ring the sampl e repr esentativen ess. In t he case of p lr , only the margin r atio log(OR /O E) has high kurtosis, but to a much l esse r extent than the standard ratios. The rest of the plr behave w ell. In contrast wi th Magrini (2025), w e analyse the full s ample without pruning outliers. Keiv ani el al. ( 2026 ) , arX iv . 8 T able 1. Skewness and kurt osis of sta ndard ratios and plr in the full sample Sk ewne ss Kurtosis (CA - CL) /( NCA+CA ) -30.3 2220.3 RE/ (NCA +CA) 1.3 4.9 (O R- OE)/( NCA+CA) -23.9 1843.6 (NCA +CA - NCL - CL ) /(N CL+CL ) 129.9 19809.4 OR /(NCA+CA) 63.9 621 1.5 (O R- OE) /CL -167.6 29201.4 CA/CL 167.5 28935.9 (NCA +CA - NCL - CL ) /(N CA+CA) -44.7 3617.9 (O R- OE)/( NCA+CA - NCL - CL) -161.2 27301.5 (NCL +CL )/ (NCA +CA) 44.7 3617.9 log(NCA/CA) -0.9 2.8 log (NCA /RE ) 0.8 2.8 log(NCA/NCL ) 0.7 0.2 log (NCA /CL ) -0.9 2.9 log(NCA/O R) -0.9 3.0 log(NCA/OE ) -0.9 3.1 log( CA/ RE) 1.8 3.4 log (CA/ NCL ) 0.8 -0.4 log (CA/ CL) 0.9 6.3 log(OR/ CA ) -0.2 5.7 log(CA/OE ) 0.1 4.7 log (RE /NCL ) 0.3 0.1 log( RE /CL ) -1.2 2.0 log(RE/OR) -1.4 2.5 log(RE/OE) -1.4 2.5 log (NCL /CL ) -0.8 -0.3 log(NCL/OR) -0.8 -0.3 log(NCL /OE) -0.8 -0.3 log(CL/OR) -0.2 5.2 log(CL/OE) -0.3 6.1 log(OR /OE) -3.8 277.2 R esul ts According to the logistic r egression model with standard ratios , the relevan t predictor s (5% signi ficance) are ( NCA+CA - NCL - CL)/ (NCL +CL), (O R- OE) /C L , and CA/CL but, due to outliers , some p redicted probabilities are numer ically equal to 0 and 1, whic h af fects the r eliability o f pr ecisely the si gnif icance measures . W ith compositio nal ratios , no such problems are enc ount ered, and the rele vant predictors are l o g( OR/ CA) and log (RE /NCL ). T he k - nearest - neighbour method do es not produce any mea sure of variable impor tance. 5 nearest neighbours yie ld the highe st accuracy . The random - fo rest method can assess the im port ance of a predict or by th e mean decrease of Gini’ s impurity if the predictor is removed. The be st standard ratio s are (NCL+ CL)/(N CA+C A), (NCA+CA - NCL - CL) /( NCL +CL) , an d (NCA +CA - NCL - CL) /( NCA+CA), all connected to the br oad concept of long - ter m indeb tedness and lever age, and the best c ompositional ratios are Keiv ani el al. ( 2026 ) , arX iv . 9 log (RE /NCL ), log(CA/NCL ) , and log(NCL/O R) , a l so connec t ed to non - current liabilities and thus to long - term indebtedness. Figure 2. Te n most important predictor s for the random - forest method with standa r d (le ft) and compositional (right) r atios T able 2 . Pr ecision mea sures of the logistic regress i on model, 5- nearest ne ighbours, and the r andom forests with standard and compositional ratios Prediction method Accu r acy Sens itivi ty Spe c if ici ty Bal anced ac cura cy Logistic regressi on (sta ndard ) 66 % 82 % 66 % 74 % Logistic regressi on (c omposi ti o nal) 67 % 86 % 67 % 76 % k- nearest n eighbours (s ta n dard) 70 % 75 % 70 % 73 % k- nearest n eighbours ( c o mpositional ) 58 % 89 % 58 % 74 % Random fores t s (standard) 62 % 75 % 62 % 68 % Random fores t s (composit i on al) 65 % 89 % 65 % 77 % T able 2 contains the precision meas ur es for all app roaches computed from the va l idation set. Compositional methods a lways have highe r sensitivity that thei r standard counterpa rts. Random forests be at them also in spe cificity . Compos itional and standard logistic regression ha ve about the same specificity . Compos itional k - nearest neighbours perform worst in terms of s pecificity . T he overall a ss essment by the balanced accu racy i s that compositional random forests and compositional logistic regres sion perform b est. Disc ussi on Bankruptcy prediction is one of t he most commo n uses of financial ratios and yet a n under- researched area in the CoDa me thodology . This article adapt s Altman’ s prediction model and some of its adaptations to this methodology w ith t hr ee common pr ediction methods, one being a statis t ical model and the remaining two machine - learning method s, and compares the performance of standard and co mpositional financial ratios, computed a s p l r. Keiv ani el al. ( 2026 ) , arX iv . 10 As reported in the literature, the C oDa methodology improves on the skewness and kurtosis of the analysis var iables and outlier pruning, which can improve the fulfilment of statis tical ass umptions bu t can al so affect th e sample rep resentati vity , i s avoided . Unlik e research d one so far with compositional bankruptcy prediction ( Magrini , 2025) , we have us ed the full s ample . The results show that be yond the fu lfilment o r not of sta tistical ass umptions, the comparison between the standard and compositional approach is also a matter of predi ctive per forman ce. C ompos itional methods a r e better in terms of sens itivity , with mixed results regarding specificity , compositiona l random forests and compositional logistic regression beha ving the best. Magrini ( 2025) did not f ind any substa ntial difference betwe en standard and compos itional rat ios for logistic r egression and random fo rest s, but this a uthor pruned the extreme outliers. In spite of their known sens i tivity to outli er s, when it comes to predi ctive per formance, log istic r egression and k nearest neighbours are just mar gi nally wor se fo r the outlier - r ich standard r atios. In spite of their known robustness t o outliers, when it comes to predictive performance, random fo r ests be have much worse for standard ratios. W e ar e of course awar e that th ese findi ngs are no t generalizable beyond this pa r ticular sector of the Spanis h economy an d the par ticul ar accounting figures and r atios chos en . The possibility o f using other log - r atio trans formations than plr deserves a comment . Centred log - ratios a.k.a. clr (Aitchison, 1983) a re coherent with Aitchison’ s distances and therefore can be used f or t h e k - near est - neighbour method with identica l results as the D ( D - 1)/2 plr . The family o f isometric log - rati o transfor matio ns a.k. a. ilr (Egozcue et al., 2003) has the same property : a ny ilr trans formation can be used for the k - n earest - neighbour method with identical res ults as the clr and the D ( D - 1)/2 plr . Any ilr transformation can also be used as p redictor in a lo gistic regres sion model with iden tical for ecasts as the D - 1 plr ( Coenders & Pawlows ky - Glahn, 2020). Tr e e - based methods in general and r andom forests in particula r cons tit ute another story . While t h e i lr family of transformations is a respect able choice (Magrini, 2025) , it m ust be borne in mind that the forecasts depe nd on the particular ilr selected within the fam ily ( T empl & Gonzalez - Rodriguez, 2024 ) , and that any of these forecasts als o differ from those obtained with the D ( D - 1)/2 plr . T he main advantage of plr i s th at t her e is only one way o f selecting all possible D ( D - 1)/2 plr , which makes t he r esults le ss a rbitrar y and always repl icable ( E ngle & Chaput, 2021 ; T olosana- Delgado et al., 2019 ) . O n the contrary , using all possible ilr is generally u nfeasib le. W ith ju st D =7 accou nting fig ures ther e are 1 . 141 distinct balanc es for the ilr transformation and it is easy to imagine what can happen when D is lar ge. clr have also be en used for random forests ( Nathwani et al., 2022), being als o r eplicable, but they yield different r esults from both ilr and plr . T olos ana- Delgado et al. ( 2019) reported su peri or perfo rmanc e of p lr ove r ilr o r clr . Fur ther r esearch can include adapting other machine - learning methods li ke neural networks and deep learning , naï ve B ayes classifi er , partial- lea st - squar es disc riminant analysis (PLS - DA) , s upport vector machines (S VM) , natural gr adi ent boosting (NGBoost), adaptative boosting ( AdaBoos t), cate gorical boosting (CatB oostc lassifier), Keiv ani el al. ( 2026 ) , arX iv . 11 light gradient boosting (lightGBM ) , and extreme gradient boosting (XGBoost) . Th e so - called hybrid models , ens emble methods , or m eta models combine more than one machine - learning approach and are also worth explori ng (Akusta & Gün , 2025; A si f et al., 2025 ; Choha n et al . , 2026; Huss ain et al., 2026 ; Liu , 202 5b ; V eganzones e t al., 2026 ). Each m achine lear ning method can have a differ ent set of compa ti bl e log - ratio transformations within the CoDa methodology an d t his has to be e xpl o r ed in the first place. Besides , each machine lea rning method may be robust to outliers to a dif f erent extent, if at all . Additional metrics to compare and int erpret the re sults of the methods can also be used (Antar et al., 2025; Chan et al., 2025 ; Thota et al., 2026 ) , like the positive and negative pr edicti ve values , Cohen’ s kappa , th e F 1 s c o re , and the ar ea under the r eceiver operating ch ara cteri stic (RO C) curve . The ch oice of the bes t method may be based not only on predictive quality but also on convenience and interpretability ( Antar et al., 20 25; Chan et al., 2025; Gwalani et al., 2025 ; Hussain et al., 2026; L in e t al., 2025; Molnar , 2020; 2025 ) . Some methods provide an explic it an d interpr etable decision rule and s ome do not, some methods make it possible to s elect the be st predictor subset a nd some do not, some are we ll known to a wide pr actiti oner au dience and som e are not , or not to th e same extent. When machin e learning methods are not inte r pretable ante - hoc, i nterpreta bility of machine learning result s can be enhanced post-hoc by means of the so - call ed explaina ble artific ial intelligenc e (X AI) methods, Shapley additive expl anations - SHAP - ( Antar et al., 2025; Chan et al. , 2026; Hus sain et al., 2026 ; Lin e t al. , 2025 ) an d l ocal in t erpr etable model - agnostic explanations - LI ME - (Antar et al., 2025; Lin et al., 2025) being the most popular . A strong limitation of our s tudy is that o ur sample had a very small number of bankrupt firms. Using a larger portion of the country’ s economy than the 2 - digit NACE clas sificatio n is a po ssibility , although predictio n s m ay be industry - specific ( Balzano & Magrini, 2025). Acknow ledgments This work was supported by the Spanish Ministry o f Science, Innovation a nd Universities MCIN/AEI/10.13039/501 10001 1033 and ERDF - a wa y of making Europe (grant number PID2021-123833OB- I 00); the Spanish Ministry of Health ( grant number CIBERCB06/02/1002); and AGAUR and the Dep ar tment of C limate Action, Food a nd Rural Agenda of Generalitat de Catalunya (grant number 2023 - CLI MA - 00037). Conf lict s of interest The au thors decl are no confl ict of interest. Data availability The cu rated dat a are available fro m the correspond ing author upon reas onable request. Keiv ani el al. ( 2026 ) , arX iv . 12 References Aitchison, J. ( 1982). T he statistica l analysis of compos itional data (with discuss ion). Journal of the R oyal Sta tistical Society Ser i es B ( Statistical Methodology ), 4 4(2 ), 139-177. https ://doi.org/10.1 1 1 1/j. 2517 -6161.1982.tb01 195. x Aitchison, J. (1983). Principal component analysis of compositional data. Biometrik a , 70(1), 57 - 6 5. ht tps://doi .org/10.1093/biomet/70 . 1.57 Aitchison, J. (1986). T he statistical analysis of compositional data. Monographs on statistics and applied pr obabili ty . London, UK : Ch apman and Hall. Aitchi son, J. , Bar celó - V idal, C., Martín - Fernán dez, J. A., & Pawlowsky - Glahn V . (2000). Logratio analysis and compositional dista nces. Mathe matical Geology , 32(3 ), 271 - 275. ht tps://doi .org/10 .1023/A:1007529726302 Akusta, A., & Gün, M. (2025). Predicting f inan cial distress in the textile industry: a comparative analysis o f meta models and s ingle cla ss ifiers. International Journal of Economics, Business and Politics , 9(1), 20 -36. https://doi.org/10.29216/ueip.1599431 Altman, E. I. (1968). Financial ratios, discrimina nt analysis and the prediction of corporate bankruptcy . The Journal of Finance , 23 ( 4), 589 - 609. https://doi.org/10.2307/2978933 Ama t, J. M ., M artín ez, J . I. & R oure , J. (2008). T ransformarse o de sapare cer: estrategias de la empr esa familiar para c ompetir en el s igl o XXI . Barcel ona, ES: Ediciones Deusto. Antar , M., Hassan, R., & B arka, D. (2025). From black box to clarity: m achine learning and agnostic techniques in cr edit r isk management. Journal of Corpor at e Accounting & Finance . Epub ahe ad of p rint. https: //doi.org/10.1002/jcaf.70020 Antar, M., & T ayachi , T . (2 025). Parti al depen dence analy sis of fi nancial rati os in predicting company defaults : r andom forest vs XGBoost models. Digital Finance , 7, 997 -1012 . https://doi.org/10 . 1007/s42521-025-00135-6 Arimany - Serr at, N., Farr eras -Noguer , M. À ., & Coenders, G. (2022). New developments in financial s tatement analy sis . Liquidity in the winery sec tor . Accounting , 8(3), 355-366. https ://doi.org/10.5267/j.ac.2021.10.002 Asif, S., Hemalatha, T ., & V i vekananda R eddy , D. ( 2025). Advancing co r porate bankruptcy forecasting: a multi - model approa ch with enhanced imbalance handling. I nternational Journal for Res earch in A pplied Science & Engineering T echnology ( IJ RASE T ), 13(XII), 183 -793. https://doi.org/10.22214/ijraset.2025.76171 Keiv ani el al. ( 2026 ) , arX iv . 13 Balzano, M., & M agrini, A . (2025). No eas y way out: disse cting firm heterogeneity t o enhance default risk prediction. S INERGI E , 43, 161 -183. https://dx.doi. org /10. 7433/s128.2025 .07 Breiman, L. (2001). Random forests. Machine Lear ni ng , 45(1), 5 - 3 2. https://doi.org/10.1023/A:1010933404324 Carreras Simó, M., & C oenders, G . (2021). T he rela tionship between as set and capital structure: a compositional appr oach wi th panel vector autoregr ess ive models. Quantitative Finance and Econo mics , 5(4), 571 -590. https://doi.org/10.3934/QFE.2021025 Ch an, C . P ., T sai , C. H ., T an g, F . K., & Y ang , J. H. (2 025) . A SHA P - based compar ative analys is of machine lea r ning model inte r pretabili ty in financ ial clas sification tas ks. Journal of A pp lied Economic Sc iences , 3(89), 385 – 400. https://doi.org/10.57017/jaes .v20. 3(89).03 Chen, Y ., L i, R., & W ang, X. (2025). Ear ly warning of financial distress in listed compani es: a machine l earni ng appr oach . Fr ontiers in Busines s and Finance, 2 (2), 229-244. https ://doi.org/10.71465/fbf378 Chohan, M. A., Li , T . , Abr ar , M., & Butt, S. ( 2026). Deep hybrid CNN - LS TM - GRU model fo r a financia l ris k early wa rning sys tem. Ri sks , 14( 1), 14 . https://doi.org/10.3390/risks14010014 Coenders, G. (2025). Application aux ratios finan ciers. I n F. Bertrand , A. Gégout- Petit, & C. Thomas- Agnan (Eds.), Données de composition , (pp. 227 - 244) . Pa ri s, FR: Éditions TECHNIP . Coenders, G., & A r imany - Serrat, N. (2025). An álisis es tadístico de la info rmac ión financiera. Metodología composicional . Girona, ES: Documenta Universitaria. https://doi.org/10.331 15/b/ 9788499847016 Coenders, G., & Arimany - Serrat, N. (in p ress). T en years of compositional analysis of financial ratios . Review , guidelines and f uture dir ections. Europe an A ccounting and Management Review . Coenders, G., E gozcue, J. J., Fačevicov á, K., Navar ro - Ló pez, C., Palar ea - Albal adejo, J., Pawlowsky - Glahn, V ., & T olosana - Delgado, R. (2023a). 40 years after Aitchison’ s arti cle “The st atistical analysi s of com positi onal data”. Wher e we are and w here we are headi ng. SORT . Statistics and Operations Rese arch T rans actions , 47(2), 207 - 228. ht tps://doi .org/10 .57645/20. 8080 .02.6 Coenders, G., & Gr eenacre, M. (2023) . Three approaches to supervised learning for compos itional data with pai rwise logratios . Journ al of Applied S tatistics , 5 0 (16 ), 3272-3293. https ://doi.org/10.1080/02664763 .2022.2108007 Keiv ani el al. ( 2026 ) , arX iv . 14 Coenders, G., & Pawlowsky - Glahn, V . (2020). On interpretations of tests and ef fect sizes in regression models with a c ompositional predictor . SORT . Statistic s and Operations Res earch T ransactions , 44( 1), 201 -220. https://doi.org/10.2436/20.8080.02.100 Coenders, G., Sgo r la, A. F ., Arimany - Serrat, N., Li nares - Must arós, S ., & Farre ras - Noguer , M. À. (2023b) . Nuevos m étodos es tadísticos composicionales para el anális is d e ratio s contables. Revista de Comptabilit at i Di rec ció , 35 , 135 -148. Cojocaru, C., & Iones cu, S. ( 2025). Mapping m achine learning rese arch networks in financial ris k p r ediction. B usiness Excell ence , 19(1), 1248 -1259. https://doi.org/10.2478/picbe-2025-0099 Comas - Cufí, M. , & T hió - Henestrosa, S. (201 1). CoDaPack 2.0: a stand - alone, multi - platform compositional software. In J. J. E gozcue , R. T olosana - Delgado, & M. I. Ortego (Eds.), CoDaW ork'1 1: 4th international wor kshop on compositional data analysis. Sant Feliu de Guíxols , (pp . 1 - 10). Girona , ES: Universitat de Girona . Creixan s - T ena s, J ., Coe nde rs, G ., & Ari ma ny - Serr at, N. (2019). Corporate social responsibility and f inancial pr ofile o f Spanish p rivate hospitals . Heliyon, 5(10), e02623. https://doi.org/10.1016/j.heliyon.2019.e02 623 Dao, B. T. T ., Coe nders , G ., Lai , P . H., Dam , T . T ., & T ri nh , H. T . (2024). An empirica l examination of financial performance and distres s profiles during Covid - 19: the case of fishery and food production firms in V ietnam. Journal of Financial Reporting and Accounting . E pub a head of p rint. https://doi.org/10.1 108/JFRA-09- 2023-0509 Dam, T . T . T ., & Nghiem , T . L. (2026). Applying machine learning to predict fina ncial distr ess: a case study of V ietnam ese real estate co mpanies. International Journal of Management Studies and Social Science R esear ch , 8(1), 1 12 - 120 . https://doi.org/10.56293/IJMSSSR.2026.6008 Dumitrescu, D., Bobitan, N. , Popa, A. F . , S ah lian , D . N. , & Stanila , C. A. (2025). Signaling financial distres s through A - scores and corpor ate gov ernance co mplian ce interplay: A r andom fo rest approach. Electronics , 14, 2151. https://doi.org/10.3390/electronics141 12151 Durana, P ., Kova l ova, E., Blazek, R., & B icano vska, K. (2025). Bus iness ef ficiency: Insights from V isegrad four before, during, and aft er the COVID - 19 pandemic. Economies , 13(2), 26. https://doi.org/10 . 3390/econ omies13020026 Egozcue. J. J ., & Pawlowsky - Glahn V . (2019). Co m positional data: the s ample spac e and its s tructure. TEST , 28(3), 599 –638. https://doi .org /10.1007/s1 1749-019- 00670-6 Egozcue, J. J ., Paw lowsky - Glahn, V . , Mateu - F igueras, G., & Barceló - V idal, C. (2003). Isometric log ratio tran sformations for compositional d ata analysis . Mathematical Geology , 35(3), 279 –300. https://doi.org/ 10. 1023/ A:1023818214614 Keiv ani el al. ( 2026 ) , arX iv . 15 Engle, M. A., & Chaput, J. A. (2021). Groundwater origin determination in historic chemical datas et s through supervised compositional data analysis: B r ines of the Per m ian Basi n, US A. In P . Fi lz mos er, K . Hr o n, J. A. Ma rtín - Fernánd ez, & J. Palar ea - Albaladejo (Eds.), Advances in c ompositional data analysis: fes tschrift in honour of V era Pawlowsky -Glahn , (pp. 265 - 283). Cham , CH : Springer . https://doi.org/10.1007/978-3-030-71 175- 7_14 Escaram ís , G., & Arbussà, A. (2025). Considerations on the use o f financial ratios in the study of family businesses . arX iv , 2501.16793. https://doi.org/10.48550/arXiv .2501. 16793 Farr eras - Noguer , M. À., Carreras - Piju an, M., L inares - Must ar ós, S. , & Ar im any - Serrat, N. (in press). Subsidies and financial pe rforman ce in the rural tourism se ctor . Europe an Accounting and Management Review . Gabrielli, G. , Melioli, A ., & Bertini, F . (2026) . Corporate financ ial distress pr ediction: a machine learning approach in the era of big data. Journal of Accounting & Or ganizational Change , 22 ( 7), 31 - 6 5. https://doi.org/10.1 108/JAOC-05-2025- 0166 Gajdosikova, D., V alaskova, K., & Durana, P . (20 26) . Cross - national benchmarking of bankruptcy prediction models across V4 economies. International Journal of Econ omic Scien ces , 15 (1), 158 -176. https://doi.org /10.31 181/ijes1512026223 Gnip, P ., Kanasz, R . , Z oričak, M., & Drotar , P . ( 2025). An e xper ime ntal s ur vey of imbalanced learning algorithms for bankruptcy prediction. Artific ial Inte lligence Review , 58(4), 104. h t tps://doi.org /10. 1007/s10462 -025- 111 0 7 -y Greenacre, M. (2019) . V ariable s election in compositional data analysis using pairwise logratios. Mathemati cal Geosciences , 51(5), 649 -682. https://doi.org/10.1007/s1 1004-018- 9754-x Greenacr e, M. , Grunsky , E., Bacon - Shone, J., Erb, I., & Quinn , T . (2023). Aitchison’ s compositional data ana lysis 40 yea r s on: A reappraisal. Statistical Sc ience , 38(3) , 386-410. https ://doi.org/10.1214/22-STS880 Grover , J., & L avin, A. (2001). Fi nancial ratios, d iscriminant analysis and the pr ediction of corporate bankrup tcy: a service indus t ry e xtension of Altman’ s Z - score model of bankruptcy prediction. W orking Paper . Southern Finance Assoc i ation. Gwalani, M., T eltumbde, G ., & Gw alani, H . (2025). T r ends in pr edict ing co r por ate distres s: a bibliometric and thematic a nalysis . NMIMS M anagement Review . , 33(4), 243-252 . https://doi.org/10 .1 177/0971 1023251360528 Hanun, R. U. , & Ferd iani, C. (2025). L iter ature analysis on finan cial distress and bankruptcy prediction. Fa irne ss , 1(1), 16 -30. https://doi.org/10.70764/gdpu - fr . 2025.1(1)-02 Keiv ani el al. ( 2026 ) , arX iv . 16 Hernan dez - Romer o, M. , & Coenders, G. (in p ress ). Financi al resil ience of agri cultur al and food production companies in S pain: A compositional clus ter analysis of the impact of the Ukraine - Russia wa r (2021 -202 3) . Eur opean Accounting and Mana gement R eview . Hia, S., Kuswanto, H., & Prastyo, D. D. (2023). Robustnes s of support vector r egression and random forest models : a simulation study . In Y . B. W ah, M. W . Berry , A. Mohamed, & D. Al - Jumeily (Eds.), Data sc i ence and em er ging technologies. DASET 2022 . L ectur e notes on data engineer i ng and communications tec hnol ogies, vol 165 , (pp. 465 - 479). Singapore , S G : Springer Natur e. https://doi.org/10.1007/978-981-99-0741-0_33 Hu ss ain , A. B ., Jing chun , S., H uss ain, B ., Sh aiqu e , M., R ehma n, A. U ., & B hat ti, M. K . (2026). Cross - market bankruptcy prediction: an i nterpretable ensemble learning framework using SHAP analysis. Computational Economics . Epub ahead of pri nt . https://doi.org/10.1007/s10614-025-1 1266-8 Insani, F ., P on toh, G. T ., & Said, D . (2024). Corporate bankruptcy prediction model: A sys tematic literature revie w . International Jour nal of Busines s, Economics and Management , 7(3), 143 -159. https://doi.org/10 .21744/ijbem.v7n3.2222 Iot ti, M ., Fe rri, G ., & Bonazzi, F . (2024). Financial ratios, credit risk and business strategy: Application to the PDO Parma ham sect or i n single production and non - single production firms. Journal of A gricultur e and Food Resear ch , 16, 101 122. https://doi.org/10.1016/j.jafr .2024.101 122 Jo fre - Campuza no, P ., & Coe nder s, G . (2022). Co mpositional classification of financial statement prof iles. The w eighted case. Journal of Risk and Financial Management , 15(12), 546. https://doi.or g/10. 3390/jrfm15120546 Kanojia, S., & Arora, A. ( 2025). Machine le arning for credit risk management through cr oss - economy evidence in default prediction. SN B usiness & E conomics , 5 (12 ), 221. ht tps://doi .org/10 .1007/s43546-025-00995- 5 Khotimah, K., & W anti W i dodo , U. P . (2026). A c om parative study of ANN and logistic regression for financial distress prediction in Indones ian manuf acturing firms. Jurnal Ilmu A kuntansi dan B isnis Syar i ah , 8(1), 53 - 7 2. https://doi.org/10.15575/aksy .v8i 1.52046 Kristanti, F . T ., Salim , D. F ., & Ihsan, A . F . (2025). Financia l distress pre diction in emerging markets by using a machine learning approach for f inancial sustainability . Discove r Sustainability , 6(1), 1296. https://doi .org/10.1007/s43621-025-02199-1 Lin, Y . C., Padlians yah, R., Lu, Y . H . , & Liu, W . R. ( 2025 ). Bankruptcy pr ediction: Integration of convolutional ne ural networks and explainable arti ficial intelligence techniques. International Journal of A ccounting In formation Systems , 56 , 100744. https://doi.org/10.1016/j.accinf.2025.100744 Keiv ani el al. ( 2026 ) , arX iv . 17 Lin ares - Mustarós, S., Coenders, G., & V ives - Mestres, M. (2 018). Financi al perfor mance and dis tress pr o f iles. Fr om classifica tion according to financial ratios to compos itional cla ss ification. Advances in Accounting , 40, 1 - 10. https://doi.org/10.1016/j.adiac.2017.10.003 Lin ares - Mustar ós, S. , Farr eras - Noguer , M. À ., Arimany - Serrat, N., & Coenders, G. (2022). New f inancial ratios based on the compositi onal data methodology . Axioms , 1 1(12), 694. https://doi. o rg/10.3390/ axioms1 1 120694 Liu, F . (2025 a ). Intelligent i denti f ication a nd earl y warning model fo r corporate financial risk based on de ep learning. Resear chSquare . https://doi.org/10 .21203/rs. 3 . rs- 7201319/v1 Liu, J., Li, Y ., & Y u, J. (2025). Predicting financial d istress w ith ESG‐ driven deep learning and risk‐bas ed stratification . International Journal of Finance & E conomics . Epub ahead of print. h t tps://doi.org/10 .1002/ijfe.70077 Liu, S. (2025 b ) . LT R - Net: A deep l earni ng - based approach for financial data prediction and risk evaluation in enterprises. P los one , 20(8), e0328013. https://doi.org/10.1371/journal.pone.0328013 Magrini, A. (2025). B ankruptcy risk p rediction: a new approach based on compositional analys is of financial sta tements. Bi g Dat a R esearch , 41 , 1000537. https://doi.org/10.1016/j.bdr .2025.100537 Martin - Melero, I., Gomez - Mar tinez, R., Medr ano - Garcia, M. L., & Her nández - Pe r lines , F . (2025). Comparison o f corporate ins olvency pr ediction in Spain e mploying fin ancial r atios in analyti cal model s and machine learn ing. Academi a Revista Latinoamericana de Administración , 38( 1), 134 -155. https://doi.org/10.1 108/ARLA- 04-2024-0060 Matsuma ru, M., & Katagiri, H. (2025). A two - stage machine lear ning appro ach to bankruptcy prediction: integrating f ull - feature modeling and optimize d feature selection . Journal of Risk and Financial Manageme nt , 18(12), 662. https://doi.org/10.3390/jrfm18120662 Mola s - Col omer , X., Fe rrer - Comalat, J. C., Carr eras - Pijuan, M., & L ina r es - Mustar ós, S. (2026). Beyond traditional ratios: predicting financial d istress through compos itional data and fuzzy cluste ring in ag r icult ural retail. Journal of Mu lti ple - V alued Logic and Soft C omputing . Mola s - Colom er , X., Li nares - Mustar ós, S. , F arr eras - Noguer , M. À., & Ferr er - Comala t, J. C. (2024). A new methodologi cal pr oposal for cla ssifying firms a ccording to the similarity of their financ ial structure s bas ed on co mbining compositiona l data with fuzzy cl usteri ng . Journal of Multiple - V alued Logic and Soft Com puting , 43, 73 -100. Molnar , C. (2020) . Interpr etable machine learning. A guide for making black box models explainable . Lulu.com. Keiv ani el al. ( 2026 ) , arX iv . 18 Molnar , C. (2025). Interpr eting machine lear ning models with SHAP: a guide w i th Python examples and theory on Shapley v al ues. Christoph Molnar . Mule t - F ort eza, C., F errer - Ros ell, B. , Martorell - Cunill, O., Linares - Musta ró s, S . (in press). The role of expansion strategies and oper ational attributes on hotel performance: a compositional approach. T ourism Management Perspec tives. https://doi.org/10.48550/arXiv .241 1. 04640 Nat hwa ni , C. L., W ilki nso n, J. J. , Fry , G., Ar mst rong , R. N., Smi th , D. J., & Ihle nf el d, C. (2022). Machine lea rning fo r geochemical explor ation: class if ying metallogenic fertility i n arc magmas and ins ight s into porph yry copper deposit formation. Mineralium Deposita , 57(7), 1 143 -1 166. htt p s://doi. org/10.1007/s 00126- 021- 01086-9 Nugroho, D. S., & De wayanto, T . (2025). Applica tion of statistics and a rtificial intelligenc e for corpo rate financial dis tress prediction mode ls: a sys tematic liter atur e review . Journal of Modelling in Management , 20(6), 1999 -2023 . https://doi.org/10.1 108/JM2-12-2024- 0412 Palar ea - Albaladejo , J., & Ma rtín - Fer nández, J. A . (2008). A modifi ed EM alr - algorith m for r eplacing rounded zeros in compositional da t a sets. Comput ers & Geosci ences , 34(8), 902 -917. https://doi.org/10 . 1016/j.cageo.2007.09.015 Pawlowsky - Glahn, V ., E gozcue, J . J., & T olosana - Delgado, R. (2015). Modeling and analysis of com positional data . Ch iches ter , UK: W iley . R Core T eam (2025). R: A language and envir onment for statis tical computing . V ienna, A T : R Foundation f o r Statistic al Computin g . https ://www .R -pr oject.or g/ . Soukal, I., Mačí, J., T rnková, G., Svobodova, L. , Hedvič áková, M ., Hamplova, E., Maresova, P ., & Lefley , F . (2024). A stat e - of - th e - art appraisal o f bankruptcy prediction models focussing on the field’ s cor e authors: 2010 -2022. Centr al Europe an M anagement Journal , 32( 1), 3 - 3 0. https ://doi.org/10.1 108/ C E MJ-08- 2022-0095 Springate, G. L . (1978). P r edicting t he possibility o f failu re in a C anadian fir m . Burnaby , BC: Simon Fras er Unive rsity . Staňková, M., & Hampel, D. (2023). Optimal th reshold of data envelopment analysis in bankruptcy prediction. SO R T . Statistics and Ope rations Resear ch T ransactions , 47(1), 129 -150. https://doi.org /10.57645/20.8080.02.3 T anaka, K. , Higashide, T ., Kinkyo, T . , & Hamori , S. (2025a). Financial mec hanisms of corpo rate ban krupt cy: ar e they different or similar across cri ses? R i sks , 13 (8), 158. https://doi.org/10.3390/risks13080158 Keiv ani el al. ( 2026 ) , arX iv . 19 T anaka, K., Higashide, T . , Kinkyo , T ., & Hamori , S. (2025 b ). A multi - stage fi nancial distress e ar ly warning s ystem: analyzing corporate insolve ncy with random fo r est. Journal of Risk and Financial Manageme nt, 18 (4), 195. https://doi.org/10.3390/jrfm18040195 T aoushianis, Z. (2025). Bankruptcy prediction with fractional polynomial trans f ormation of financial ratio s. Eur opean Journal of Operatio nal Resear ch , 327(1), 690 -702. https://doi.org/10.1016/j.ejor . 2025.07.036 Te m p l , M., & Gonzalez - Rodriguez, J . (2024). Advancing forensic r esea rch: An examination of compositional data analys i s with an applica tion on petrol fraud detecti on. Science & Ju sti ce , 64(1) , 9 -18. https://doi.org/10.1016/j.scijus.2023.1 1.003 Tho ta , N ., De sa i, G. , Puli , S., Subra hman yam , A. C . V ., & V ishw esw ars as try , V . N . (2026). Predicting bankruptcy in wholesa l e, retail, and motor vehicle repair: An AI - ML per spective. F1000Resear ch , 14, 1251 . https://doi.org/10.12688/f1000research.170279.2 T olosana- Delgado, R., T alebi, H. , Khodadadz adeh, M. , & V an den Boogaart, K . G. (2019). On machine learni ng algor ithms and compositional data. I n J. J . Egozcue, J. Gr aff el m an , & M. I. O rte go (E ds. ), Pr oceedings of the 8th inter national work shop on compositional data analysis , (pp. 172 - 175). Bar celona , ES : Catal an Pol ytechni c University - Barcel onaTE CH. T ran, K . P . (2025). Apprentissage automatique pour l’analyse de données de composition. In F . Bertrand, A. Gégout - Petit, C. Thomas - Agnan (Eds.), Données de compos ition , (pp. 85 - 108). Paris, FR: É ditions T ECHNIP . V egan zones, D. , & S é verin , E. (2021 ). Corporate fa ilur e prediction models in the twenty - fir st centur y: a r eview . E ur opean B usine ss Review , 33(2) , 204 -226. https://doi.org/10.1 108/E BR-12-2018-0209 V eganzones, D., Séverin, E., & Jabeur , S. B. (2026). Forecas ti ng corporate bankruptcy in imbalanced datas et s using a new hybrid machine learning approach. Research in International Business and Fi nance , 81, 103200. https://doi.org/10.1016/j.ribaf.2025.103200 V enab les, W . N ., & R ipley , B . D. (2002). Moder n a pplied statistic s with S, Fourth edition . New Y ork, N Y : Springer . https://doi.org/10.1007 / 9 78-0-387-21706- 2 W ang, Z., Che, W . , J iang, C. , & Zhao, H . (2026). Leveraging multiview data through discrete and regularized deep le arning for dynamic financial risk prediction. Information Systems Res ear ch . https://doi.org/10. 1287/i sre.2024.1417 Keiv ani el al. ( 2026 ) , arX iv . 20 W ill er do Prado, J., de Cas t ro Alcântar a, V ., de Melo Carvalho , F . , Carvalho V iei ra, K., Cruz Machado, L. K . , & Flávio T onelli, D . (2016 ). Multivariate analys is of credit risk and bankr uptcy r esearch data : a bibliometric study involving dif ferent knowledge fields (1968 -2014). Sciento metrics , 106(3), 1007 -1029. https://doi.org/10.1007/s1 1 192- 015-1829-6 Xu, Y . (2026). Big data and mac hine learning methods in financia l risk prediction . Scientific Journal of Economics and Manageme nt Resear ch , 8( 1), 53 -58. https://doi.org/10.54691/9eb6x521 Y uan, S. ( 2025) . Algo rithmic transformation of fina ncial risk identifica tion methods : from traditional mode ls to da ta - intelligent framew orks. Journal of Applied E conomics and Policy Studies , 18(7), 62 - 66. https://doi.org/10 . 54254/2977-5701/2025.25760 Zhao, W . (2025). Mac hine learning for sustainable financial sys tems: ass e ssing corporate resili ence and def ault r isk. Advances in Management and Intelligent T echnologies , 1(6), 1 - 11 . https://doi.org/10 .62177/ amit.v1i6.849 Zhu, X., S un , H., Chang, Y ., & Li, J. ( 2026). Predicting financial distres s using textual risk disclosures i n annual reports: How and what risks ar e disclos ed? The British Accou nting R eview , 101836. https://doi.or g/10. 1016/j.bar .2026.101836 Zmijewski, M. E. ( 1983). Ess ays on corporate bankruptcy . Docto ral thesis . State University of New Y ork at Buf f alo.

Adapting Altman's bankruptcy prediction model to the compositional data methodology

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment