Overview

Brought to you by YData

Dataset statistics

Number of variables11
Number of observations99340
Missing cells0
Missing cells (%)0.0%
Total size in memory3.7 MiB
Average record size in memory39.0 B

Variable types

Numeric4
Categorical7

Alerts

admission_type_id is highly imbalanced (50.7%)Imbalance
race is highly imbalanced (55.8%)Imbalance
Race is highly imbalanced (55.8%)Imbalance
num_procedures has 45679 (46.0%) zerosZeros
Readmitted within 30 Days_Value has 96594 (97.2%) zerosZeros

Reproduction

Analysis started2024-10-04 14:44:42.676118
Analysis finished2024-10-04 14:44:42.868093
Duration0.19 seconds
Software versionydata-profiling vv4.10.0
Download configurationconfig.json

Variables

LGBM_score
Real number (ℝ)

Distinct99272
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.11
Minimum0.017
Maximum0.81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-10-04T14:44:42.998723image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0.017
5-th percentile0.041
Q10.066
median0.094
Q30.14
95-th percentile0.25
Maximum0.81
Range0.79
Interquartile range (IQR)0.076

Descriptive statistics

Standard deviation0.071
Coefficient of variation (CV)0.62
Kurtosis7.2
Mean0.11
Median Absolute Deviation (MAD)0.034
Skewness2.1
Sum1.1 × 104
Variance0.0051
MonotonicityNot monotonic
2024-10-04T14:44:43.235250image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.02536078832 15
 
< 0.1%
0.02440186969 10
 
< 0.1%
0.02513752123 7
 
< 0.1%
0.02430971475 5
 
< 0.1%
0.02612457459 3
 
< 0.1%
Other values (99267) 99300
> 99.9%
ValueCountFrequency (%)
0.01670154448 1
< 0.1%
0.01764539855 1
< 0.1%
0.01788551743 1
< 0.1%
0.01802084233 1
< 0.1%
0.01823796389 1
< 0.1%
ValueCountFrequency (%)
0.8053033583 1
< 0.1%
0.7965720262 1
< 0.1%
0.7727199983 1
< 0.1%
0.7487666901 1
< 0.1%
0.7349983041 1
< 0.1%

admission_type_id
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Emergency
70501 
Elective
18667 
Unknown
10144 
Trauma Center
 
18
New Born
 
10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUnknown
2nd rowEmergency
3rd rowEmergency
4th rowEmergency
5th rowEmergency

Common Values

ValueCountFrequency (%)
Emergency 70501
71.0%
Elective 18667
 
18.8%
Unknown 10144
 
10.2%
Trauma Center 18
 
< 0.1%
New Born 10
 
< 0.1%

Common Values (Plot)

2024-10-04T14:44:43.432440image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

age
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
70+
44352 
[50-70)
39118 
[20-50)
15020 
[10-20)
 
690
[0-10)
 
160

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[0-10)
2nd row[10-20)
3rd row[20-50)
4th row[20-50)
5th row[20-50)

Common Values

ValueCountFrequency (%)
70+ 44352
44.6%
[50-70) 39118
39.4%
[20-50) 15020
 
15.1%
[10-20) 690
 
0.7%
[0-10) 160
 
0.2%

Common Values (Plot)

2024-10-04T14:44:43.588023image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Female
53454 
Male
45886 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowFemale
3rd rowFemale
4th rowMale
5th rowMale

Common Values

ValueCountFrequency (%)
Female 53454
53.8%
Male 45886
46.2%

Common Values (Plot)

2024-10-04T14:44:43.726159image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

num_medications
Real number (ℝ)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-10-04T14:44:43.892591image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q110
median15
Q320
95-th percentile31
Maximum81
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.1
Coefficient of variation (CV)0.51
Kurtosis3.5
Mean16
Median Absolute Deviation (MAD)5
Skewness1.3
Sum1.6 × 106
Variance66
MonotonicityNot monotonic
2024-10-04T14:44:44.121138image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 5976
 
6.0%
12 5888
 
5.9%
11 5696
 
5.7%
15 5694
 
5.7%
14 5592
 
5.6%
Other values (70) 70494
71.0%
ValueCountFrequency (%)
1 260
 
0.3%
2 457
 
0.5%
3 874
0.9%
4 1383
1.4%
5 1964
2.0%
ValueCountFrequency (%)
81 1
 
< 0.1%
79 1
 
< 0.1%
75 2
< 0.1%
74 1
 
< 0.1%
72 3
< 0.1%

num_procedures
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3
Minimum0
Maximum6
Zeros45679
Zeros (%)46.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-10-04T14:44:44.301797image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7
Coefficient of variation (CV)1.3
Kurtosis0.88
Mean1.3
Median Absolute Deviation (MAD)1
Skewness1.3
Sum1.3 × 105
Variance2.9
MonotonicityNot monotonic
2024-10-04T14:44:44.451860image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 45679
46.0%
1 20249
20.4%
2 12372
 
12.5%
3 9203
 
9.3%
6 4801
 
4.8%
Other values (2) 7036
 
7.1%
ValueCountFrequency (%)
0 45679
46.0%
1 20249
20.4%
2 12372
 
12.5%
3 9203
 
9.3%
4 4049
 
4.1%
ValueCountFrequency (%)
6 4801
 
4.8%
5 2987
 
3.0%
4 4049
 
4.1%
3 9203
9.3%
2 12372
12.5%

race
Categorical

IMBALANCE 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.4 KiB
Caucasian
74220 
AfricanAmerican
18772 
Unknown
 
2232
Hispanic
 
2017
Other
 
1471

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCaucasian
2nd rowCaucasian
3rd rowAfricanAmerican
4th rowCaucasian
5th rowCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 74220
74.7%
AfricanAmerican 18772
 
18.9%
Unknown 2232
 
2.2%
Hispanic 2017
 
2.0%
Other 1471
 
1.5%

Common Values (Plot)

2024-10-04T14:44:44.610919image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value
Real number (ℝ)

ZEROS 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.028
Minimum0
Maximum1
Zeros96594
Zeros (%)97.2%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-10-04T14:44:44.879641image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.16
Coefficient of variation (CV)5.9
Kurtosis31
Mean0.028
Median Absolute Deviation (MAD)0
Skewness5.8
Sum2.7 × 103
Variance0.027
MonotonicityNot monotonic
2024-10-04T14:44:45.029539image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
0 96594
97.2%
1 2746
 
2.8%
ValueCountFrequency (%)
0 96594
97.2%
1 2746
 
2.8%
ValueCountFrequency (%)
1 2746
 
2.8%
0 96594
97.2%

Age
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
70+
44352 
[50-70)
39118 
[20-50)
15020 
[10-20)
 
690
[0-10)
 
160

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[0-10)
2nd row[10-20)
3rd row[20-50)
4th row[20-50)
5th row[20-50)

Common Values

ValueCountFrequency (%)
70+ 44352
44.6%
[50-70) 39118
39.4%
[20-50) 15020
 
15.1%
[10-20) 690
 
0.7%
[0-10) 160
 
0.2%

Common Values (Plot)

2024-10-04T14:44:45.180461image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Race
Categorical

IMBALANCE 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.4 KiB
Caucasian
74220 
AfricanAmerican
18772 
Unknown
 
2232
Hispanic
 
2017
Other
 
1471

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCaucasian
2nd rowCaucasian
3rd rowAfricanAmerican
4th rowCaucasian
5th rowCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 74220
74.7%
AfricanAmerican 18772
 
18.9%
Unknown 2232
 
2.2%
Hispanic 2017
 
2.0%
Other 1471
 
1.5%

Common Values (Plot)

2024-10-04T14:44:45.335016image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Female
53454 
Male
45886 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowFemale
3rd rowFemale
4th rowMale
5th rowMale

Common Values

ValueCountFrequency (%)
Female 53454
53.8%
Male 45886
46.2%

Common Values (Plot)

2024-10-04T14:44:45.476217image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/