Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 99340 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Total size in memory | 3.7 MiB |
Average record size in memory | 39.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 7 |
admission_type_id is highly imbalanced (50.7%) | Imbalance |
race is highly imbalanced (55.8%) | Imbalance |
Race is highly imbalanced (55.8%) | Imbalance |
num_procedures has 45679 (46.0%) zeros | Zeros |
Readmitted within 30 Days_Value has 96594 (97.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-10-04 14:44:42.676118 |
---|---|
Analysis finished | 2024-10-04 14:44:42.868093 |
Duration | 0.19 seconds |
Software version | ydata-profiling vv4.10.0 |
Download configuration | config.json |
LGBM_score
Real number (ℝ)
Distinct | 99272 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.11 |
Minimum | 0.017 |
---|---|
Maximum | 0.81 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 776.2 KiB |
Quantile statistics
Minimum | 0.017 |
---|---|
5-th percentile | 0.041 |
Q1 | 0.066 |
median | 0.094 |
Q3 | 0.14 |
95-th percentile | 0.25 |
Maximum | 0.81 |
Range | 0.79 |
Interquartile range (IQR) | 0.076 |
Descriptive statistics
Standard deviation | 0.071 |
---|---|
Coefficient of variation (CV) | 0.62 |
Kurtosis | 7.2 |
Mean | 0.11 |
Median Absolute Deviation (MAD) | 0.034 |
Skewness | 2.1 |
Sum | 1.1 × 104 |
Variance | 0.0051 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.02536078832 | 15 | < 0.1% |
0.02440186969 | 10 | < 0.1% |
0.02513752123 | 7 | < 0.1% |
0.02430971475 | 5 | < 0.1% |
0.02612457459 | 3 | < 0.1% |
Other values (99267) | 99300 |
Value | Count | Frequency (%) |
0.01670154448 | 1 | |
0.01764539855 | 1 | |
0.01788551743 | 1 | |
0.01802084233 | 1 | |
0.01823796389 | 1 |
Value | Count | Frequency (%) |
0.8053033583 | 1 | |
0.7965720262 | 1 | |
0.7727199983 | 1 | |
0.7487666901 | 1 | |
0.7349983041 | 1 |
admission_type_id
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.3 KiB |
Emergency | |
---|---|
Elective | |
Unknown | |
Trauma Center | 18 |
New Born | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Unknown |
---|---|
2nd row | Emergency |
3rd row | Emergency |
4th row | Emergency |
5th row | Emergency |
Common Values
Value | Count | Frequency (%) |
Emergency | 70501 | |
Elective | 18667 | 18.8% |
Unknown | 10144 | 10.2% |
Trauma Center | 18 | < 0.1% |
New Born | 10 | < 0.1% |
Common Values (Plot)
age
Categorical
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.3 KiB |
70+ | |
---|---|
[50-70) | |
[20-50) | |
[10-20) | 690 |
[0-10) | 160 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | [0-10) |
---|---|
2nd row | [10-20) |
3rd row | [20-50) |
4th row | [20-50) |
5th row | [20-50) |
Common Values
Value | Count | Frequency (%) |
70+ | 44352 | |
[50-70) | 39118 | |
[20-50) | 15020 | 15.1% |
[10-20) | 690 | 0.7% |
[0-10) | 160 | 0.2% |
Common Values (Plot)
gender
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.3 KiB |
Female | |
---|---|
Male |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Female |
---|---|
2nd row | Female |
3rd row | Female |
4th row | Male |
5th row | Male |
Common Values
Value | Count | Frequency (%) |
Female | 53454 | |
Male | 45886 |
Common Values (Plot)
num_medications
Real number (ℝ)
Distinct | 75 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16 |
Minimum | 1 |
---|---|
Maximum | 81 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 776.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6 |
Q1 | 10 |
median | 15 |
Q3 | 20 |
95-th percentile | 31 |
Maximum | 81 |
Range | 80 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 8.1 |
---|---|
Coefficient of variation (CV) | 0.51 |
Kurtosis | 3.5 |
Mean | 16 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 1.3 |
Sum | 1.6 × 106 |
Variance | 66 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
13 | 5976 | 6.0% |
12 | 5888 | 5.9% |
11 | 5696 | 5.7% |
15 | 5694 | 5.7% |
14 | 5592 | 5.6% |
Other values (70) | 70494 |
Value | Count | Frequency (%) |
1 | 260 | 0.3% |
2 | 457 | 0.5% |
3 | 874 | |
4 | 1383 | |
5 | 1964 |
Value | Count | Frequency (%) |
81 | 1 | < 0.1% |
79 | 1 | < 0.1% |
75 | 2 | |
74 | 1 | < 0.1% |
72 | 3 |
num_procedures
Real number (ℝ)
ZEROS
 
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3 |
Minimum | 0 |
---|---|
Maximum | 6 |
Zeros | 45679 |
Zeros (%) | 46.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 776.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 2 |
95-th percentile | 5 |
Maximum | 6 |
Range | 6 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.7 |
---|---|
Coefficient of variation (CV) | 1.3 |
Kurtosis | 0.88 |
Mean | 1.3 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.3 |
Sum | 1.3 × 105 |
Variance | 2.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 45679 | |
1 | 20249 | |
2 | 12372 | 12.5% |
3 | 9203 | 9.3% |
6 | 4801 | 4.8% |
Other values (2) | 7036 | 7.1% |
Value | Count | Frequency (%) |
0 | 45679 | |
1 | 20249 | |
2 | 12372 | 12.5% |
3 | 9203 | 9.3% |
4 | 4049 | 4.1% |
Value | Count | Frequency (%) |
6 | 4801 | 4.8% |
5 | 2987 | 3.0% |
4 | 4049 | 4.1% |
3 | 9203 | |
2 | 12372 |
race
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.4 KiB |
Caucasian | |
---|---|
AfricanAmerican | |
Unknown | 2232 |
Hispanic | 2017 |
Other | 1471 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Caucasian |
---|---|
2nd row | Caucasian |
3rd row | AfricanAmerican |
4th row | Caucasian |
5th row | Caucasian |
Common Values
Value | Count | Frequency (%) |
Caucasian | 74220 | |
AfricanAmerican | 18772 | 18.9% |
Unknown | 2232 | 2.2% |
Hispanic | 2017 | 2.0% |
Other | 1471 | 1.5% |
Common Values (Plot)
Readmitted within 30 Days_Value
Real number (ℝ)
ZEROS
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.028 |
Minimum | 0 |
---|---|
Maximum | 1 |
Zeros | 96594 |
Zeros (%) | 97.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 776.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 1 |
Range | 1 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.16 |
---|---|
Coefficient of variation (CV) | 5.9 |
Kurtosis | 31 |
Mean | 0.028 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.8 |
Sum | 2.7 × 103 |
Variance | 0.027 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 96594 | |
1 | 2746 | 2.8% |
Value | Count | Frequency (%) |
0 | 96594 | |
1 | 2746 | 2.8% |
Value | Count | Frequency (%) |
1 | 2746 | 2.8% |
0 | 96594 |
Age
Categorical
Distinct | 5 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.3 KiB |
70+ | |
---|---|
[50-70) | |
[20-50) | |
[10-20) | 690 |
[0-10) | 160 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | [0-10) |
---|---|
2nd row | [10-20) |
3rd row | [20-50) |
4th row | [20-50) |
5th row | [20-50) |
Common Values
Value | Count | Frequency (%) |
70+ | 44352 | |
[50-70) | 39118 | |
[20-50) | 15020 | 15.1% |
[10-20) | 690 | 0.7% |
[0-10) | 160 | 0.2% |
Common Values (Plot)
Race
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.4 KiB |
Caucasian | |
---|---|
AfricanAmerican | |
Unknown | 2232 |
Hispanic | 2017 |
Other | 1471 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Caucasian |
---|---|
2nd row | Caucasian |
3rd row | AfricanAmerican |
4th row | Caucasian |
5th row | Caucasian |
Common Values
Value | Count | Frequency (%) |
Caucasian | 74220 | |
AfricanAmerican | 18772 | 18.9% |
Unknown | 2232 | 2.2% |
Hispanic | 2017 | 2.0% |
Other | 1471 | 1.5% |
Common Values (Plot)
Gender
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 97.3 KiB |
Female | |
---|---|
Male |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Female |
---|---|
2nd row | Female |
3rd row | Female |
4th row | Male |
5th row | Male |
Common Values
Value | Count | Frequency (%) |
Female | 53454 | |
Male | 45886 |