Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 MiB |
Average record size in memory | 196.9 B |
Variable types
NUM | 5 |
---|---|
CAT | 3 |
BOOL | 3 |
Reproduction
Analysis started | 2020-05-20 04:34:45.540283 |
---|---|
Analysis finished | 2020-05-20 04:34:58.955192 |
Duration | 13.41 seconds |
Version | pandas-profiling v2.8.0 |
Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
Download configuration | config.yaml |
CreditScore
Real number (ℝ≥0)
Distinct count | 460 |
---|---|
Unique (%) | 4.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 650.5288 |
---|---|
Minimum | 350 |
Maximum | 850 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 350 |
---|---|
5-th percentile | 489 |
Q1 | 584 |
median | 652 |
Q3 | 718 |
95-th percentile | 812 |
Maximum | 850 |
Range | 500 |
Interquartile range (IQR) | 134 |
Descriptive statistics
Standard deviation | 96.65329874 |
---|---|
Coefficient of variation (CV) | 0.14857651 |
Kurtosis | -0.4257256848 |
Mean | 650.5288 |
Median Absolute Deviation (MAD) | 67 |
Skewness | -0.0716066082 |
Sum | 6505288 |
Variance | 9341.860157 |
Value | Count | Frequency (%) | |
850 | 233 | 2.3% | |
678 | 63 | 0.6% | |
655 | 54 | 0.5% | |
705 | 53 | 0.5% | |
667 | 53 | 0.5% | |
684 | 52 | 0.5% | |
670 | 50 | 0.5% | |
651 | 50 | 0.5% | |
683 | 48 | 0.5% | |
660 | 48 | 0.5% | |
652 | 48 | 0.5% | |
648 | 48 | 0.5% | |
682 | 47 | 0.5% | |
640 | 47 | 0.5% | |
663 | 47 | 0.5% | |
637 | 46 | 0.5% | |
679 | 45 | 0.4% | |
714 | 45 | 0.4% | |
710 | 45 | 0.4% | |
645 | 45 | 0.4% | |
686 | 45 | 0.4% | |
687 | 45 | 0.4% | |
633 | 45 | 0.4% | |
646 | 44 | 0.4% | |
619 | 44 | 0.4% | |
Other values (435) | 8610 | 86.1% |
Value | Count | Frequency (%) | |
350 | 5 | 0.1% | |
351 | 1 | < 0.1% | |
358 | 1 | < 0.1% | |
359 | 1 | < 0.1% | |
363 | 1 | < 0.1% | |
365 | 1 | < 0.1% | |
367 | 1 | < 0.1% | |
373 | 1 | < 0.1% | |
376 | 2 | < 0.1% | |
382 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
850 | 233 | 2.3% | |
849 | 8 | 0.1% | |
848 | 5 | 0.1% | |
847 | 6 | 0.1% | |
846 | 5 | 0.1% | |
845 | 6 | 0.1% | |
844 | 7 | 0.1% | |
843 | 2 | < 0.1% | |
842 | 7 | 0.1% | |
841 | 12 | 0.1% |
Geography
Categorical
Distinct count | 3 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
France | |
---|---|
Germany | |
Spain |
Value | Count | Frequency (%) | |
France | 5014 | 50.1% | |
Germany | 2509 | 25.1% | |
Spain | 2477 | 24.8% |
Length
Max length | 7 |
---|---|
Median length | 6 |
Mean length | 6.0032 |
Min length | 5 |
Most occurring characters
Value | Count | Frequency (%) | |
a | 10000 | 16.7% | |
n | 10000 | 16.7% | |
r | 7523 | 12.5% | |
e | 7523 | 12.5% | |
F | 5014 | 8.4% | |
c | 5014 | 8.4% | |
G | 2509 | 4.2% | |
m | 2509 | 4.2% | |
y | 2509 | 4.2% | |
S | 2477 | 4.1% | |
p | 2477 | 4.1% | |
i | 2477 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 50032 | 83.3% | |
Uppercase Letter | 10000 | 16.7% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
F | 5014 | 50.1% | |
G | 2509 | 25.1% | |
S | 2477 | 24.8% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
a | 10000 | 20.0% | |
n | 10000 | 20.0% | |
r | 7523 | 15.0% | |
e | 7523 | 15.0% | |
c | 5014 | 10.0% | |
m | 2509 | 5.0% | |
y | 2509 | 5.0% | |
p | 2477 | 5.0% | |
i | 2477 | 5.0% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 60032 | 100.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
a | 10000 | 16.7% | |
n | 10000 | 16.7% | |
r | 7523 | 12.5% | |
e | 7523 | 12.5% | |
F | 5014 | 8.4% | |
c | 5014 | 8.4% | |
G | 2509 | 4.2% | |
m | 2509 | 4.2% | |
y | 2509 | 4.2% | |
S | 2477 | 4.1% | |
p | 2477 | 4.1% | |
i | 2477 | 4.1% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 60032 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
a | 10000 | 16.7% | |
n | 10000 | 16.7% | |
r | 7523 | 12.5% | |
e | 7523 | 12.5% | |
F | 5014 | 8.4% | |
c | 5014 | 8.4% | |
G | 2509 | 4.2% | |
m | 2509 | 4.2% | |
y | 2509 | 4.2% | |
S | 2477 | 4.1% | |
p | 2477 | 4.1% | |
i | 2477 | 4.1% |
Gender
Categorical
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Male | |
---|---|
Female |
Value | Count | Frequency (%) | |
Male | 5457 | 54.6% | |
Female | 4543 | 45.4% |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.9086 |
Min length | 4 |
Most occurring characters
Value | Count | Frequency (%) | |
e | 14543 | 29.6% | |
a | 10000 | 20.4% | |
l | 10000 | 20.4% | |
M | 5457 | 11.1% | |
F | 4543 | 9.3% | |
m | 4543 | 9.3% |
Most occurring categories
Value | Count | Frequency (%) | |
Lowercase Letter | 39086 | 79.6% | |
Uppercase Letter | 10000 | 20.4% |
Most frequent Uppercase Letter characters
Value | Count | Frequency (%) | |
M | 5457 | 54.6% | |
F | 4543 | 45.4% |
Most frequent Lowercase Letter characters
Value | Count | Frequency (%) | |
e | 14543 | 37.2% | |
a | 10000 | 25.6% | |
l | 10000 | 25.6% | |
m | 4543 | 11.6% |
Most occurring scripts
Value | Count | Frequency (%) | |
Latin | 49086 | 100.0% |
Most frequent Latin characters
Value | Count | Frequency (%) | |
e | 14543 | 29.6% | |
a | 10000 | 20.4% | |
l | 10000 | 20.4% | |
M | 5457 | 11.1% | |
F | 4543 | 9.3% | |
m | 4543 | 9.3% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 49086 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
e | 14543 | 29.6% | |
a | 10000 | 20.4% | |
l | 10000 | 20.4% | |
M | 5457 | 11.1% | |
F | 4543 | 9.3% | |
m | 4543 | 9.3% |
Age
Real number (ℝ≥0)
Distinct count | 70 |
---|---|
Unique (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.9218 |
---|---|
Minimum | 18 |
Maximum | 92 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 18 |
---|---|
5-th percentile | 25 |
Q1 | 32 |
median | 37 |
Q3 | 44 |
95-th percentile | 60 |
Maximum | 92 |
Range | 74 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 10.48780645 |
---|---|
Coefficient of variation (CV) | 0.2694584128 |
Kurtosis | 1.395347062 |
Mean | 38.9218 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.011320263 |
Sum | 389218 |
Variance | 109.9940842 |
Value | Count | Frequency (%) | |
37 | 478 | 4.8% | |
38 | 477 | 4.8% | |
35 | 474 | 4.7% | |
36 | 456 | 4.6% | |
34 | 447 | 4.5% | |
33 | 442 | 4.4% | |
40 | 432 | 4.3% | |
39 | 423 | 4.2% | |
32 | 418 | 4.2% | |
31 | 404 | 4.0% | |
41 | 366 | 3.7% | |
29 | 348 | 3.5% | |
30 | 327 | 3.3% | |
42 | 321 | 3.2% | |
43 | 297 | 3.0% | |
28 | 273 | 2.7% | |
44 | 257 | 2.6% | |
45 | 229 | 2.3% | |
46 | 226 | 2.3% | |
27 | 209 | 2.1% | |
26 | 200 | 2.0% | |
47 | 175 | 1.8% | |
48 | 168 | 1.7% | |
25 | 154 | 1.5% | |
49 | 147 | 1.5% | |
Other values (45) | 1852 | 18.5% |
Value | Count | Frequency (%) | |
18 | 22 | 0.2% | |
19 | 27 | 0.3% | |
20 | 40 | 0.4% | |
21 | 53 | 0.5% | |
22 | 84 | 0.8% | |
23 | 99 | 1.0% | |
24 | 132 | 1.3% | |
25 | 154 | 1.5% | |
26 | 200 | 2.0% | |
27 | 209 | 2.1% |
Value | Count | Frequency (%) | |
92 | 2 | < 0.1% | |
88 | 1 | < 0.1% | |
85 | 1 | < 0.1% | |
84 | 2 | < 0.1% | |
83 | 1 | < 0.1% | |
82 | 1 | < 0.1% | |
81 | 4 | < 0.1% | |
80 | 3 | < 0.1% | |
79 | 4 | < 0.1% | |
78 | 5 | 0.1% |
Distinct count | 11 |
---|---|
Unique (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.0128 |
---|---|
Minimum | 0 |
Maximum | 10 |
Zeros | 413 |
Zeros (%) | 4.1% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 9 |
Maximum | 10 |
Range | 10 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.892174377 |
---|---|
Coefficient of variation (CV) | 0.5769578633 |
Kurtosis | -1.165225227 |
Mean | 5.0128 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.01099145798 |
Sum | 50128 |
Variance | 8.364672627 |
Value | Count | Frequency (%) | |
2 | 1048 | 10.5% | |
1 | 1035 | 10.3% | |
7 | 1028 | 10.3% | |
8 | 1025 | 10.2% | |
5 | 1012 | 10.1% | |
3 | 1009 | 10.1% | |
4 | 989 | 9.9% | |
9 | 984 | 9.8% | |
6 | 967 | 9.7% | |
10 | 490 | 4.9% | |
0 | 413 | 4.1% |
Value | Count | Frequency (%) | |
0 | 413 | 4.1% | |
1 | 1035 | 10.3% | |
2 | 1048 | 10.5% | |
3 | 1009 | 10.1% | |
4 | 989 | 9.9% | |
5 | 1012 | 10.1% | |
6 | 967 | 9.7% | |
7 | 1028 | 10.3% | |
8 | 1025 | 10.2% | |
9 | 984 | 9.8% |
Value | Count | Frequency (%) | |
10 | 490 | 4.9% | |
9 | 984 | 9.8% | |
8 | 1025 | 10.2% | |
7 | 1028 | 10.3% | |
6 | 967 | 9.7% | |
5 | 1012 | 10.1% | |
4 | 989 | 9.9% | |
3 | 1009 | 10.1% | |
2 | 1048 | 10.5% | |
1 | 1035 | 10.3% |
Distinct count | 6382 |
---|---|
Unique (%) | 63.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 76485.889288 |
---|---|
Minimum | 0.0 |
Maximum | 250898.09 |
Zeros | 3617 |
Zeros (%) | 36.2% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 97198.54 |
Q3 | 127644.24 |
95-th percentile | 162711.669 |
Maximum | 250898.09 |
Range | 250898.09 |
Interquartile range (IQR) | 127644.24 |
Descriptive statistics
Standard deviation | 62397.4052 |
---|---|
Coefficient of variation (CV) | 0.8158028335 |
Kurtosis | -1.489411768 |
Mean | 76485.88929 |
Median Absolute Deviation (MAD) | 46766.79 |
Skewness | -0.1411087109 |
Sum | 764858892.9 |
Variance | 3893436176 |
Value | Count | Frequency (%) | |
0 | 3617 | 36.2% | |
105473.74 | 2 | < 0.1% | |
130170.82 | 2 | < 0.1% | |
113063.83 | 1 | < 0.1% | |
80242.37 | 1 | < 0.1% | |
134320.23 | 1 | < 0.1% | |
90218.9 | 1 | < 0.1% | |
155196.17 | 1 | < 0.1% | |
95386.82 | 1 | < 0.1% | |
125961.74 | 1 | < 0.1% | |
126606.63 | 1 | < 0.1% | |
82794.18 | 1 | < 0.1% | |
120782.7 | 1 | < 0.1% | |
167557.12 | 1 | < 0.1% | |
122338.43 | 1 | < 0.1% | |
128504.76 | 1 | < 0.1% | |
102016.38 | 1 | < 0.1% | |
190479.48 | 1 | < 0.1% | |
182065.85 | 1 | < 0.1% | |
124547.13 | 1 | < 0.1% | |
151933.63 | 1 | < 0.1% | |
118546.71 | 1 | < 0.1% | |
141806.46 | 1 | < 0.1% | |
98807.45 | 1 | < 0.1% | |
119703.1 | 1 | < 0.1% | |
Other values (6357) | 6357 | 63.6% |
Value | Count | Frequency (%) | |
0 | 3617 | 36.2% | |
3768.69 | 1 | < 0.1% | |
12459.19 | 1 | < 0.1% | |
14262.8 | 1 | < 0.1% | |
16893.59 | 1 | < 0.1% | |
23503.31 | 1 | < 0.1% | |
24043.45 | 1 | < 0.1% | |
27288.43 | 1 | < 0.1% | |
27517.15 | 1 | < 0.1% | |
27755.97 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
250898.09 | 1 | < 0.1% | |
238387.56 | 1 | < 0.1% | |
222267.63 | 1 | < 0.1% | |
221532.8 | 1 | < 0.1% | |
216109.88 | 1 | < 0.1% | |
214346.96 | 1 | < 0.1% | |
213146.2 | 1 | < 0.1% | |
212778.2 | 1 | < 0.1% | |
212696.32 | 1 | < 0.1% | |
212692.97 | 1 | < 0.1% |
NumOfProducts
Categorical
Distinct count | 4 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
2 | |
3 | 266 |
4 | 60 |
Value | Count | Frequency (%) | |
1 | 5084 | 50.8% | |
2 | 4590 | 45.9% | |
3 | 266 | 2.7% | |
4 | 60 | 0.6% |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Most occurring characters
Value | Count | Frequency (%) | |
1 | 5084 | 50.8% | |
2 | 4590 | 45.9% | |
3 | 266 | 2.7% | |
4 | 60 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) | |
Decimal Number | 10000 | 100.0% |
Most frequent Decimal Number characters
Value | Count | Frequency (%) | |
1 | 5084 | 50.8% | |
2 | 4590 | 45.9% | |
3 | 266 | 2.7% | |
4 | 60 | 0.6% |
Most occurring scripts
Value | Count | Frequency (%) | |
Common | 10000 | 100.0% |
Most frequent Common characters
Value | Count | Frequency (%) | |
1 | 5084 | 50.8% | |
2 | 4590 | 45.9% | |
3 | 266 | 2.7% | |
4 | 60 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) | |
ASCII | 10000 | 100.0% |
Most frequent ASCII characters
Value | Count | Frequency (%) | |
1 | 5084 | 50.8% | |
2 | 4590 | 45.9% | |
3 | 266 | 2.7% | |
4 | 60 | 0.6% |
HasCrCard
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
0 |
Value | Count | Frequency (%) | |
1 | 7055 | 70.5% | |
0 | 2945 | 29.4% |
IsActiveMember
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
0 |
Value | Count | Frequency (%) | |
1 | 5151 | 51.5% | |
0 | 4849 | 48.5% |
EstimatedSalary
Real number (ℝ≥0)
Distinct count | 9999 |
---|---|
Unique (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 100090.239881 |
---|---|
Minimum | 11.58 |
Maximum | 199992.48 |
Zeros | 0 |
Zeros (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 11.58 |
---|---|
5-th percentile | 9851.8185 |
Q1 | 51002.11 |
median | 100193.915 |
Q3 | 149388.2475 |
95-th percentile | 190155.3755 |
Maximum | 199992.48 |
Range | 199980.9 |
Interquartile range (IQR) | 98386.1375 |
Descriptive statistics
Standard deviation | 57510.49282 |
---|---|
Coefficient of variation (CV) | 0.5745864221 |
Kurtosis | -1.181518447 |
Mean | 100090.2399 |
Median Absolute Deviation (MAD) | 49198.15 |
Skewness | 0.002085357662 |
Sum | 1000902399 |
Variance | 3307456784 |
Value | Count | Frequency (%) | |
24924.92 | 2 | < 0.1% | |
109029.72 | 1 | < 0.1% | |
182025.95 | 1 | < 0.1% | |
82820.85 | 1 | < 0.1% | |
30314.04 | 1 | < 0.1% | |
143265.65 | 1 | < 0.1% | |
148305.82 | 1 | < 0.1% | |
21254.06 | 1 | < 0.1% | |
56297.85 | 1 | < 0.1% | |
113481.02 | 1 | < 0.1% | |
185992.36 | 1 | < 0.1% | |
69370.05 | 1 | < 0.1% | |
76679.6 | 1 | < 0.1% | |
77469.38 | 1 | < 0.1% | |
179291.85 | 1 | < 0.1% | |
133172.48 | 1 | < 0.1% | |
59374.82 | 1 | < 0.1% | |
194700.81 | 1 | < 0.1% | |
168023.6 | 1 | < 0.1% | |
180456.8 | 1 | < 0.1% | |
68367.18 | 1 | < 0.1% | |
52581.96 | 1 | < 0.1% | |
22762.23 | 1 | < 0.1% | |
75888.65 | 1 | < 0.1% | |
21215.67 | 1 | < 0.1% | |
Other values (9974) | 9974 | 99.7% |
Value | Count | Frequency (%) | |
11.58 | 1 | < 0.1% | |
90.07 | 1 | < 0.1% | |
91.75 | 1 | < 0.1% | |
96.27 | 1 | < 0.1% | |
106.67 | 1 | < 0.1% | |
123.07 | 1 | < 0.1% | |
142.81 | 1 | < 0.1% | |
143.34 | 1 | < 0.1% | |
178.19 | 1 | < 0.1% | |
216.27 | 1 | < 0.1% |
Value | Count | Frequency (%) | |
199992.48 | 1 | < 0.1% | |
199970.74 | 1 | < 0.1% | |
199953.33 | 1 | < 0.1% | |
199929.17 | 1 | < 0.1% | |
199909.32 | 1 | < 0.1% | |
199862.75 | 1 | < 0.1% | |
199857.47 | 1 | < 0.1% | |
199841.32 | 1 | < 0.1% | |
199808.1 | 1 | < 0.1% | |
199805.63 | 1 | < 0.1% |
Exited
Boolean
Distinct count | 2 |
---|---|
Unique (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
0 | |
---|---|
1 |
Value | Count | Frequency (%) | |
0 | 7963 | 79.6% | |
1 | 2037 | 20.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 619 | France | Female | 42 | 2 | 0.00 | 1 | 1 | 1 | 101348.88 | 1 |
1 | 608 | Spain | Female | 41 | 1 | 83807.86 | 1 | 0 | 1 | 112542.58 | 0 |
2 | 502 | France | Female | 42 | 8 | 159660.80 | 3 | 1 | 0 | 113931.57 | 1 |
3 | 699 | France | Female | 39 | 1 | 0.00 | 2 | 0 | 0 | 93826.63 | 0 |
4 | 850 | Spain | Female | 43 | 2 | 125510.82 | 1 | 1 | 1 | 79084.10 | 0 |
5 | 645 | Spain | Male | 44 | 8 | 113755.78 | 2 | 1 | 0 | 149756.71 | 1 |
6 | 822 | France | Male | 50 | 7 | 0.00 | 2 | 1 | 1 | 10062.80 | 0 |
7 | 376 | Germany | Female | 29 | 4 | 115046.74 | 4 | 1 | 0 | 119346.88 | 1 |
8 | 501 | France | Male | 44 | 4 | 142051.07 | 2 | 0 | 1 | 74940.50 | 0 |
9 | 684 | France | Male | 27 | 2 | 134603.88 | 1 | 1 | 1 | 71725.73 | 0 |
Last rows
CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
---|---|---|---|---|---|---|---|---|---|---|---|
9990 | 714 | Germany | Male | 33 | 3 | 35016.60 | 1 | 1 | 0 | 53667.08 | 0 |
9991 | 597 | France | Female | 53 | 4 | 88381.21 | 1 | 1 | 0 | 69384.71 | 1 |
9992 | 726 | Spain | Male | 36 | 2 | 0.00 | 1 | 1 | 0 | 195192.40 | 0 |
9993 | 644 | France | Male | 28 | 7 | 155060.41 | 1 | 1 | 0 | 29179.52 | 0 |
9994 | 800 | France | Female | 29 | 2 | 0.00 | 2 | 0 | 0 | 167773.55 | 0 |
9995 | 771 | France | Male | 39 | 5 | 0.00 | 2 | 1 | 0 | 96270.64 | 0 |
9996 | 516 | France | Male | 35 | 10 | 57369.61 | 1 | 1 | 1 | 101699.77 | 0 |
9997 | 709 | France | Female | 36 | 7 | 0.00 | 1 | 0 | 1 | 42085.58 | 1 |
9998 | 772 | Germany | Male | 42 | 3 | 75075.31 | 2 | 1 | 0 | 92888.52 | 1 |
9999 | 792 | France | Female | 28 | 4 | 130142.79 | 1 | 1 | 0 | 38190.78 | 0 |