Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.9 MiB |
| Average record size in memory | 196.9 B |
Variable types
| NUM | 5 |
|---|---|
| CAT | 3 |
| BOOL | 3 |
Reproduction
| Analysis started | 2020-05-20 04:34:45.540283 |
|---|---|
| Analysis finished | 2020-05-20 04:34:58.955192 |
| Duration | 13.41 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
CreditScore
Real number (ℝ≥0)
| Distinct count | 460 |
|---|---|
| Unique (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 650.5288 |
|---|---|
| Minimum | 350 |
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 350 |
|---|---|
| 5-th percentile | 489 |
| Q1 | 584 |
| median | 652 |
| Q3 | 718 |
| 95-th percentile | 812 |
| Maximum | 850 |
| Range | 500 |
| Interquartile range (IQR) | 134 |
Descriptive statistics
| Standard deviation | 96.65329874 |
|---|---|
| Coefficient of variation (CV) | 0.14857651 |
| Kurtosis | -0.4257256848 |
| Mean | 650.5288 |
| Median Absolute Deviation (MAD) | 67 |
| Skewness | -0.0716066082 |
| Sum | 6505288 |
| Variance | 9341.860157 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 850 | 233 | 2.3% | |
| 678 | 63 | 0.6% | |
| 655 | 54 | 0.5% | |
| 705 | 53 | 0.5% | |
| 667 | 53 | 0.5% | |
| 684 | 52 | 0.5% | |
| 670 | 50 | 0.5% | |
| 651 | 50 | 0.5% | |
| 683 | 48 | 0.5% | |
| 660 | 48 | 0.5% | |
| 652 | 48 | 0.5% | |
| 648 | 48 | 0.5% | |
| 682 | 47 | 0.5% | |
| 640 | 47 | 0.5% | |
| 663 | 47 | 0.5% | |
| 637 | 46 | 0.5% | |
| 679 | 45 | 0.4% | |
| 714 | 45 | 0.4% | |
| 710 | 45 | 0.4% | |
| 645 | 45 | 0.4% | |
| 686 | 45 | 0.4% | |
| 687 | 45 | 0.4% | |
| 633 | 45 | 0.4% | |
| 646 | 44 | 0.4% | |
| 619 | 44 | 0.4% | |
| Other values (435) | 8610 | 86.1% |
| Value | Count | Frequency (%) | |
| 350 | 5 | 0.1% | |
| 351 | 1 | < 0.1% | |
| 358 | 1 | < 0.1% | |
| 359 | 1 | < 0.1% | |
| 363 | 1 | < 0.1% | |
| 365 | 1 | < 0.1% | |
| 367 | 1 | < 0.1% | |
| 373 | 1 | < 0.1% | |
| 376 | 2 | < 0.1% | |
| 382 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 850 | 233 | 2.3% | |
| 849 | 8 | 0.1% | |
| 848 | 5 | 0.1% | |
| 847 | 6 | 0.1% | |
| 846 | 5 | 0.1% | |
| 845 | 6 | 0.1% | |
| 844 | 7 | 0.1% | |
| 843 | 2 | < 0.1% | |
| 842 | 7 | 0.1% | |
| 841 | 12 | 0.1% |
Geography
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| France | |
|---|---|
| Germany | |
| Spain |
| Value | Count | Frequency (%) | |
| France | 5014 | 50.1% | |
| Germany | 2509 | 25.1% | |
| Spain | 2477 | 24.8% |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.0032 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 10000 | 16.7% | |
| n | 10000 | 16.7% | |
| r | 7523 | 12.5% | |
| e | 7523 | 12.5% | |
| F | 5014 | 8.4% | |
| c | 5014 | 8.4% | |
| G | 2509 | 4.2% | |
| m | 2509 | 4.2% | |
| y | 2509 | 4.2% | |
| S | 2477 | 4.1% | |
| p | 2477 | 4.1% | |
| i | 2477 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 50032 | 83.3% | |
| Uppercase Letter | 10000 | 16.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 5014 | 50.1% | |
| G | 2509 | 25.1% | |
| S | 2477 | 24.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 10000 | 20.0% | |
| n | 10000 | 20.0% | |
| r | 7523 | 15.0% | |
| e | 7523 | 15.0% | |
| c | 5014 | 10.0% | |
| m | 2509 | 5.0% | |
| y | 2509 | 5.0% | |
| p | 2477 | 5.0% | |
| i | 2477 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 60032 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 10000 | 16.7% | |
| n | 10000 | 16.7% | |
| r | 7523 | 12.5% | |
| e | 7523 | 12.5% | |
| F | 5014 | 8.4% | |
| c | 5014 | 8.4% | |
| G | 2509 | 4.2% | |
| m | 2509 | 4.2% | |
| y | 2509 | 4.2% | |
| S | 2477 | 4.1% | |
| p | 2477 | 4.1% | |
| i | 2477 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 60032 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 10000 | 16.7% | |
| n | 10000 | 16.7% | |
| r | 7523 | 12.5% | |
| e | 7523 | 12.5% | |
| F | 5014 | 8.4% | |
| c | 5014 | 8.4% | |
| G | 2509 | 4.2% | |
| m | 2509 | 4.2% | |
| y | 2509 | 4.2% | |
| S | 2477 | 4.1% | |
| p | 2477 | 4.1% | |
| i | 2477 | 4.1% |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| Male | |
|---|---|
| Female |
| Value | Count | Frequency (%) | |
| Male | 5457 | 54.6% | |
| Female | 4543 | 45.4% |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.9086 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 14543 | 29.6% | |
| a | 10000 | 20.4% | |
| l | 10000 | 20.4% | |
| M | 5457 | 11.1% | |
| F | 4543 | 9.3% | |
| m | 4543 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 39086 | 79.6% | |
| Uppercase Letter | 10000 | 20.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 5457 | 54.6% | |
| F | 4543 | 45.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 14543 | 37.2% | |
| a | 10000 | 25.6% | |
| l | 10000 | 25.6% | |
| m | 4543 | 11.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 49086 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 14543 | 29.6% | |
| a | 10000 | 20.4% | |
| l | 10000 | 20.4% | |
| M | 5457 | 11.1% | |
| F | 4543 | 9.3% | |
| m | 4543 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 49086 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 14543 | 29.6% | |
| a | 10000 | 20.4% | |
| l | 10000 | 20.4% | |
| M | 5457 | 11.1% | |
| F | 4543 | 9.3% | |
| m | 4543 | 9.3% |
Age
Real number (ℝ≥0)
| Distinct count | 70 |
|---|---|
| Unique (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.9218 |
|---|---|
| Minimum | 18 |
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 32 |
| median | 37 |
| Q3 | 44 |
| 95-th percentile | 60 |
| Maximum | 92 |
| Range | 74 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 10.48780645 |
|---|---|
| Coefficient of variation (CV) | 0.2694584128 |
| Kurtosis | 1.395347062 |
| Mean | 38.9218 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.011320263 |
| Sum | 389218 |
| Variance | 109.9940842 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 37 | 478 | 4.8% | |
| 38 | 477 | 4.8% | |
| 35 | 474 | 4.7% | |
| 36 | 456 | 4.6% | |
| 34 | 447 | 4.5% | |
| 33 | 442 | 4.4% | |
| 40 | 432 | 4.3% | |
| 39 | 423 | 4.2% | |
| 32 | 418 | 4.2% | |
| 31 | 404 | 4.0% | |
| 41 | 366 | 3.7% | |
| 29 | 348 | 3.5% | |
| 30 | 327 | 3.3% | |
| 42 | 321 | 3.2% | |
| 43 | 297 | 3.0% | |
| 28 | 273 | 2.7% | |
| 44 | 257 | 2.6% | |
| 45 | 229 | 2.3% | |
| 46 | 226 | 2.3% | |
| 27 | 209 | 2.1% | |
| 26 | 200 | 2.0% | |
| 47 | 175 | 1.8% | |
| 48 | 168 | 1.7% | |
| 25 | 154 | 1.5% | |
| 49 | 147 | 1.5% | |
| Other values (45) | 1852 | 18.5% |
| Value | Count | Frequency (%) | |
| 18 | 22 | 0.2% | |
| 19 | 27 | 0.3% | |
| 20 | 40 | 0.4% | |
| 21 | 53 | 0.5% | |
| 22 | 84 | 0.8% | |
| 23 | 99 | 1.0% | |
| 24 | 132 | 1.3% | |
| 25 | 154 | 1.5% | |
| 26 | 200 | 2.0% | |
| 27 | 209 | 2.1% |
| Value | Count | Frequency (%) | |
| 92 | 2 | < 0.1% | |
| 88 | 1 | < 0.1% | |
| 85 | 1 | < 0.1% | |
| 84 | 2 | < 0.1% | |
| 83 | 1 | < 0.1% | |
| 82 | 1 | < 0.1% | |
| 81 | 4 | < 0.1% | |
| 80 | 3 | < 0.1% | |
| 79 | 4 | < 0.1% | |
| 78 | 5 | 0.1% |
| Distinct count | 11 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0128 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 413 |
| Zeros (%) | 4.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.892174377 |
|---|---|
| Coefficient of variation (CV) | 0.5769578633 |
| Kurtosis | -1.165225227 |
| Mean | 5.0128 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.01099145798 |
| Sum | 50128 |
| Variance | 8.364672627 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2 | 1048 | 10.5% | |
| 1 | 1035 | 10.3% | |
| 7 | 1028 | 10.3% | |
| 8 | 1025 | 10.2% | |
| 5 | 1012 | 10.1% | |
| 3 | 1009 | 10.1% | |
| 4 | 989 | 9.9% | |
| 9 | 984 | 9.8% | |
| 6 | 967 | 9.7% | |
| 10 | 490 | 4.9% | |
| 0 | 413 | 4.1% |
| Value | Count | Frequency (%) | |
| 0 | 413 | 4.1% | |
| 1 | 1035 | 10.3% | |
| 2 | 1048 | 10.5% | |
| 3 | 1009 | 10.1% | |
| 4 | 989 | 9.9% | |
| 5 | 1012 | 10.1% | |
| 6 | 967 | 9.7% | |
| 7 | 1028 | 10.3% | |
| 8 | 1025 | 10.2% | |
| 9 | 984 | 9.8% |
| Value | Count | Frequency (%) | |
| 10 | 490 | 4.9% | |
| 9 | 984 | 9.8% | |
| 8 | 1025 | 10.2% | |
| 7 | 1028 | 10.3% | |
| 6 | 967 | 9.7% | |
| 5 | 1012 | 10.1% | |
| 4 | 989 | 9.9% | |
| 3 | 1009 | 10.1% | |
| 2 | 1048 | 10.5% | |
| 1 | 1035 | 10.3% |
| Distinct count | 6382 |
|---|---|
| Unique (%) | 63.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76485.889288 |
|---|---|
| Minimum | 0.0 |
| Maximum | 250898.09 |
| Zeros | 3617 |
| Zeros (%) | 36.2% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 97198.54 |
| Q3 | 127644.24 |
| 95-th percentile | 162711.669 |
| Maximum | 250898.09 |
| Range | 250898.09 |
| Interquartile range (IQR) | 127644.24 |
Descriptive statistics
| Standard deviation | 62397.4052 |
|---|---|
| Coefficient of variation (CV) | 0.8158028335 |
| Kurtosis | -1.489411768 |
| Mean | 76485.88929 |
| Median Absolute Deviation (MAD) | 46766.79 |
| Skewness | -0.1411087109 |
| Sum | 764858892.9 |
| Variance | 3893436176 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 3617 | 36.2% | |
| 105473.74 | 2 | < 0.1% | |
| 130170.82 | 2 | < 0.1% | |
| 113063.83 | 1 | < 0.1% | |
| 80242.37 | 1 | < 0.1% | |
| 134320.23 | 1 | < 0.1% | |
| 90218.9 | 1 | < 0.1% | |
| 155196.17 | 1 | < 0.1% | |
| 95386.82 | 1 | < 0.1% | |
| 125961.74 | 1 | < 0.1% | |
| 126606.63 | 1 | < 0.1% | |
| 82794.18 | 1 | < 0.1% | |
| 120782.7 | 1 | < 0.1% | |
| 167557.12 | 1 | < 0.1% | |
| 122338.43 | 1 | < 0.1% | |
| 128504.76 | 1 | < 0.1% | |
| 102016.38 | 1 | < 0.1% | |
| 190479.48 | 1 | < 0.1% | |
| 182065.85 | 1 | < 0.1% | |
| 124547.13 | 1 | < 0.1% | |
| 151933.63 | 1 | < 0.1% | |
| 118546.71 | 1 | < 0.1% | |
| 141806.46 | 1 | < 0.1% | |
| 98807.45 | 1 | < 0.1% | |
| 119703.1 | 1 | < 0.1% | |
| Other values (6357) | 6357 | 63.6% |
| Value | Count | Frequency (%) | |
| 0 | 3617 | 36.2% | |
| 3768.69 | 1 | < 0.1% | |
| 12459.19 | 1 | < 0.1% | |
| 14262.8 | 1 | < 0.1% | |
| 16893.59 | 1 | < 0.1% | |
| 23503.31 | 1 | < 0.1% | |
| 24043.45 | 1 | < 0.1% | |
| 27288.43 | 1 | < 0.1% | |
| 27517.15 | 1 | < 0.1% | |
| 27755.97 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 250898.09 | 1 | < 0.1% | |
| 238387.56 | 1 | < 0.1% | |
| 222267.63 | 1 | < 0.1% | |
| 221532.8 | 1 | < 0.1% | |
| 216109.88 | 1 | < 0.1% | |
| 214346.96 | 1 | < 0.1% | |
| 213146.2 | 1 | < 0.1% | |
| 212778.2 | 1 | < 0.1% | |
| 212696.32 | 1 | < 0.1% | |
| 212692.97 | 1 | < 0.1% |
NumOfProducts
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 266 |
| 4 | 60 |
| Value | Count | Frequency (%) | |
| 1 | 5084 | 50.8% | |
| 2 | 4590 | 45.9% | |
| 3 | 266 | 2.7% | |
| 4 | 60 | 0.6% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 5084 | 50.8% | |
| 2 | 4590 | 45.9% | |
| 3 | 266 | 2.7% | |
| 4 | 60 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 10000 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 5084 | 50.8% | |
| 2 | 4590 | 45.9% | |
| 3 | 266 | 2.7% | |
| 4 | 60 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 10000 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 5084 | 50.8% | |
| 2 | 4590 | 45.9% | |
| 3 | 266 | 2.7% | |
| 4 | 60 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 10000 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 5084 | 50.8% | |
| 2 | 4590 | 45.9% | |
| 3 | 266 | 2.7% | |
| 4 | 60 | 0.6% |
HasCrCard
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 7055 | 70.5% | |
| 0 | 2945 | 29.4% |
IsActiveMember
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 5151 | 51.5% | |
| 0 | 4849 | 48.5% |
EstimatedSalary
Real number (ℝ≥0)
| Distinct count | 9999 |
|---|---|
| Unique (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100090.239881 |
|---|---|
| Minimum | 11.58 |
| Maximum | 199992.48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 11.58 |
|---|---|
| 5-th percentile | 9851.8185 |
| Q1 | 51002.11 |
| median | 100193.915 |
| Q3 | 149388.2475 |
| 95-th percentile | 190155.3755 |
| Maximum | 199992.48 |
| Range | 199980.9 |
| Interquartile range (IQR) | 98386.1375 |
Descriptive statistics
| Standard deviation | 57510.49282 |
|---|---|
| Coefficient of variation (CV) | 0.5745864221 |
| Kurtosis | -1.181518447 |
| Mean | 100090.2399 |
| Median Absolute Deviation (MAD) | 49198.15 |
| Skewness | 0.002085357662 |
| Sum | 1000902399 |
| Variance | 3307456784 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 24924.92 | 2 | < 0.1% | |
| 109029.72 | 1 | < 0.1% | |
| 182025.95 | 1 | < 0.1% | |
| 82820.85 | 1 | < 0.1% | |
| 30314.04 | 1 | < 0.1% | |
| 143265.65 | 1 | < 0.1% | |
| 148305.82 | 1 | < 0.1% | |
| 21254.06 | 1 | < 0.1% | |
| 56297.85 | 1 | < 0.1% | |
| 113481.02 | 1 | < 0.1% | |
| 185992.36 | 1 | < 0.1% | |
| 69370.05 | 1 | < 0.1% | |
| 76679.6 | 1 | < 0.1% | |
| 77469.38 | 1 | < 0.1% | |
| 179291.85 | 1 | < 0.1% | |
| 133172.48 | 1 | < 0.1% | |
| 59374.82 | 1 | < 0.1% | |
| 194700.81 | 1 | < 0.1% | |
| 168023.6 | 1 | < 0.1% | |
| 180456.8 | 1 | < 0.1% | |
| 68367.18 | 1 | < 0.1% | |
| 52581.96 | 1 | < 0.1% | |
| 22762.23 | 1 | < 0.1% | |
| 75888.65 | 1 | < 0.1% | |
| 21215.67 | 1 | < 0.1% | |
| Other values (9974) | 9974 | 99.7% |
| Value | Count | Frequency (%) | |
| 11.58 | 1 | < 0.1% | |
| 90.07 | 1 | < 0.1% | |
| 91.75 | 1 | < 0.1% | |
| 96.27 | 1 | < 0.1% | |
| 106.67 | 1 | < 0.1% | |
| 123.07 | 1 | < 0.1% | |
| 142.81 | 1 | < 0.1% | |
| 143.34 | 1 | < 0.1% | |
| 178.19 | 1 | < 0.1% | |
| 216.27 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 199992.48 | 1 | < 0.1% | |
| 199970.74 | 1 | < 0.1% | |
| 199953.33 | 1 | < 0.1% | |
| 199929.17 | 1 | < 0.1% | |
| 199909.32 | 1 | < 0.1% | |
| 199862.75 | 1 | < 0.1% | |
| 199857.47 | 1 | < 0.1% | |
| 199841.32 | 1 | < 0.1% | |
| 199808.1 | 1 | < 0.1% | |
| 199805.63 | 1 | < 0.1% |
Exited
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 7963 | 79.6% | |
| 1 | 2037 | 20.4% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 619 | France | Female | 42 | 2 | 0.00 | 1 | 1 | 1 | 101348.88 | 1 |
| 1 | 608 | Spain | Female | 41 | 1 | 83807.86 | 1 | 0 | 1 | 112542.58 | 0 |
| 2 | 502 | France | Female | 42 | 8 | 159660.80 | 3 | 1 | 0 | 113931.57 | 1 |
| 3 | 699 | France | Female | 39 | 1 | 0.00 | 2 | 0 | 0 | 93826.63 | 0 |
| 4 | 850 | Spain | Female | 43 | 2 | 125510.82 | 1 | 1 | 1 | 79084.10 | 0 |
| 5 | 645 | Spain | Male | 44 | 8 | 113755.78 | 2 | 1 | 0 | 149756.71 | 1 |
| 6 | 822 | France | Male | 50 | 7 | 0.00 | 2 | 1 | 1 | 10062.80 | 0 |
| 7 | 376 | Germany | Female | 29 | 4 | 115046.74 | 4 | 1 | 0 | 119346.88 | 1 |
| 8 | 501 | France | Male | 44 | 4 | 142051.07 | 2 | 0 | 1 | 74940.50 | 0 |
| 9 | 684 | France | Male | 27 | 2 | 134603.88 | 1 | 1 | 1 | 71725.73 | 0 |
Last rows
| CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 714 | Germany | Male | 33 | 3 | 35016.60 | 1 | 1 | 0 | 53667.08 | 0 |
| 9991 | 597 | France | Female | 53 | 4 | 88381.21 | 1 | 1 | 0 | 69384.71 | 1 |
| 9992 | 726 | Spain | Male | 36 | 2 | 0.00 | 1 | 1 | 0 | 195192.40 | 0 |
| 9993 | 644 | France | Male | 28 | 7 | 155060.41 | 1 | 1 | 0 | 29179.52 | 0 |
| 9994 | 800 | France | Female | 29 | 2 | 0.00 | 2 | 0 | 0 | 167773.55 | 0 |
| 9995 | 771 | France | Male | 39 | 5 | 0.00 | 2 | 1 | 0 | 96270.64 | 0 |
| 9996 | 516 | France | Male | 35 | 10 | 57369.61 | 1 | 1 | 1 | 101699.77 | 0 |
| 9997 | 709 | France | Female | 36 | 7 | 0.00 | 1 | 0 | 1 | 42085.58 | 1 |
| 9998 | 772 | Germany | Male | 42 | 3 | 75075.31 | 2 | 1 | 0 | 92888.52 | 1 |
| 9999 | 792 | France | Female | 28 | 4 | 130142.79 | 1 | 1 | 0 | 38190.78 | 0 |