Overview

Brought to you by YData

Dataset statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Number of variables1111
Number of observations965942746
Missing cells00
Missing cells (%)0.0%0.0%
Total size in memory4.3 MiB127.3 KiB
Average record size in memory47.0 B47.5 B

Variable types

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Numeric44
Categorical77

Alerts

Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Readmitted within 30 Days_Value has constant value "0" Readmitted within 30 Days_Value has constant value "1" Constant
admission_type_id is highly imbalanced (50.7%) Alert not present in this datasetImbalance
race is highly imbalanced (55.7%) race is highly imbalanced (59.6%) Imbalance
Race is highly imbalanced (55.7%) Race is highly imbalanced (59.6%) Imbalance
num_procedures has 44431 (46.0%) zeros num_procedures has 1248 (45.4%) zeros Zeros
Readmitted within 30 Days_Value has 96594 (100.0%) zeros Alert not present in this datasetZeros
Alert not present in this datasetLGBM_score has unique values Unique

Reproduction

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Analysis started2024-10-04 14:44:46.5281592024-10-04 14:44:46.712877
Analysis finished2024-10-04 14:44:46.7092422024-10-04 14:44:46.768025
Duration0.18 seconds0.06 seconds
Software versionydata-profiling vv4.10.0ydata-profiling vv4.10.0
Download configurationconfig.jsonconfig.json

Variables

LGBM_score
Real number (ℝ)

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct965272746
Distinct (%)99.9%100.0%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean0.110.18
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum0.0170.033
Maximum0.770.81
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size1.5 MiB42.9 KiB
2024-10-04T14:44:47.198952image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum0.0170.033
5-th percentile0.0410.065
Q10.0650.11
median0.0930.16
Q30.140.24
95-th percentile0.240.39
Maximum0.770.81
Range0.760.77
Interquartile range (IQR)0.0740.13

Descriptive statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Standard deviation0.0690.11
Coefficient of variation (CV)0.620.57
Kurtosis7.13
Mean0.110.18
Median Absolute Deviation (MAD)0.0330.061
Skewness2.11.5
Sum1.1 × 1045.1 × 102
Variance0.00480.011
MonotonicityNot monotonicNot monotonic
2024-10-04T14:44:47.496061image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.02536078832 15
 
< 0.1%
0.02440186969 10
 
< 0.1%
0.02513752123 7
 
< 0.1%
0.02430971475 5
 
< 0.1%
0.02540571953 3
 
< 0.1%
Other values (96522) 96554
> 99.9%
ValueCountFrequency (%)
0.3041835405 1
 
< 0.1%
0.126975873 1
 
< 0.1%
0.269070649 1
 
< 0.1%
0.0825586868 1
 
< 0.1%
0.4296344879 1
 
< 0.1%
Other values (2741) 2741
99.8%
ValueCountFrequency (%)
0.01670154448 1
< 0.1%
0.01764539855 1
< 0.1%
0.01788551743 1
< 0.1%
0.01802084233 1
< 0.1%
0.01823796389 1
< 0.1%
ValueCountFrequency (%)
0.0329087373 1
< 0.1%
0.03510931307 1
< 0.1%
0.03712336873 1
< 0.1%
0.03958424191 1
< 0.1%
0.04127887936 1
< 0.1%
ValueCountFrequency (%)
0.0329087373 1
< 0.1%
0.03510931307 1
< 0.1%
0.03712336873 1
< 0.1%
0.03958424191 1
< 0.1%
0.04127887936 1
< 0.1%
ValueCountFrequency (%)
0.01670154448 1
< 0.1%
0.01764539855 1
< 0.1%
0.01788551743 1
< 0.1%
0.01802084233 1
< 0.1%
0.01823796389 1
< 0.1%
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct53
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Memory size849.2 KiB24.3 KiB
Emergency
68485 
Elective
18189 
Unknown
9892 
Trauma Center
 
18
New Born
 
10
Emergency
2016 
Elective
478 
Unknown
252 

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st rowUnknownElective
2nd rowEmergencyEmergency
3rd rowEmergencyUnknown
4th rowEmergencyUnknown
5th rowEmergencyUnknown

Common Values

ValueCountFrequency (%)
Emergency 68485
70.9%
Elective 18189
 
18.8%
Unknown 9892
 
10.2%
Trauma Center 18
 
< 0.1%
New Born 10
 
< 0.1%
ValueCountFrequency (%)
Emergency 2016
73.4%
Elective 478
 
17.4%
Unknown 252
 
9.2%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:47.712686image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:47.861547image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

age
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct55
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Memory size849.2 KiB24.3 KiB
70+
43017 
[50-70)
38107 
[20-50)
14626 
[10-20)
 
685
[0-10)
 
159
70+
1335 
[50-70)
1011 
[20-50)
394 
[10-20)
 
5
[0-10)
 
1

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique01 ?
Unique (%)0.0%< 0.1%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st row[0-10)70+
2nd row[10-20)[50-70)
3rd row[20-50)70+
4th row[20-50)[50-70)
5th row[20-50)70+

Common Values

ValueCountFrequency (%)
70+ 43017
44.5%
[50-70) 38107
39.5%
[20-50) 14626
 
15.1%
[10-20) 685
 
0.7%
[0-10) 159
 
0.2%
ValueCountFrequency (%)
70+ 1335
48.6%
[50-70) 1011
36.8%
[20-50) 394
 
14.3%
[10-20) 5
 
0.2%
[0-10) 1
 
< 0.1%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:48.004260image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:48.158799image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

gender
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct22
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Memory size849.1 KiB24.3 KiB
Female
51990 
Male
44604 
Female
1464 
Male
1282 

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st rowFemaleFemale
2nd rowFemaleMale
3rd rowFemaleFemale
4th rowMaleMale
5th rowMaleFemale

Common Values

ValueCountFrequency (%)
Female 51990
53.8%
Male 44604
46.2%
ValueCountFrequency (%)
Female 1464
53.3%
Male 1282
46.7%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:48.297957image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:48.417334image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

num_medications
Real number (ℝ)

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct7558
Distinct (%)0.1%2.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean1617
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum11
Maximum8172
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size1.5 MiB42.9 KiB
2024-10-04T14:44:48.629986image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum11
5-th percentile56
Q11012
median1516
Q32022
95-th percentile3131
Maximum8172
Range8071
Interquartile range (IQR)1010

Descriptive statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Standard deviation8.18.2
Coefficient of variation (CV)0.510.48
Kurtosis3.54
Mean1617
Median Absolute Deviation (MAD)55
Skewness1.31.3
Sum1.5 × 1064.7 × 104
Variance6567
MonotonicityNot monotonicNot monotonic
2024-10-04T14:44:48.919149image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 5812
 
6.0%
12 5737
 
5.9%
11 5557
 
5.8%
15 5525
 
5.7%
14 5458
 
5.7%
Other values (70) 68505
70.9%
ValueCountFrequency (%)
15 169
 
6.2%
13 164
 
6.0%
12 151
 
5.5%
16 147
 
5.4%
11 139
 
5.1%
Other values (53) 1976
72.0%
ValueCountFrequency (%)
1 254
 
0.3%
2 447
 
0.5%
3 862
0.9%
4 1354
1.4%
5 1935
2.0%
ValueCountFrequency (%)
1 6
 
0.2%
2 10
 
0.4%
3 12
0.4%
4 29
1.1%
5 29
1.1%
ValueCountFrequency (%)
1 6
 
< 0.1%
2 10
 
< 0.1%
3 12
< 0.1%
4 29
< 0.1%
5 29
< 0.1%
ValueCountFrequency (%)
1 254
 
9.2%
2 447
 
16.3%
3 862
31.4%
4 1354
49.3%
5 1935
70.5%

num_procedures
Real number (ℝ)

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct77
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean1.31.3
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum00
Maximum66
Zeros444311248
Zeros (%)46.0%45.4%
Negative00
Negative (%)0.0%0.0%
Memory size1.5 MiB42.9 KiB
2024-10-04T14:44:49.127931image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum00
5-th percentile00
Q100
median11
Q322
95-th percentile55
Maximum66
Range66
Interquartile range (IQR)22

Descriptive statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Standard deviation1.71.6
Coefficient of variation (CV)1.31.3
Kurtosis0.871.1
Mean1.31.3
Median Absolute Deviation (MAD)11
Skewness1.31.4
Sum1.3 × 1053.5 × 103
Variance2.92.7
MonotonicityNot monotonicNot monotonic
2024-10-04T14:44:49.287122image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 44431
46.0%
1 19644
20.3%
2 12035
 
12.5%
3 8947
 
9.3%
6 4687
 
4.9%
Other values (2) 6850
 
7.1%
ValueCountFrequency (%)
0 1248
45.4%
1 605
22.0%
2 337
 
12.3%
3 256
 
9.3%
6 114
 
4.2%
Other values (2) 186
 
6.8%
ValueCountFrequency (%)
0 44431
46.0%
1 19644
20.3%
2 12035
 
12.5%
3 8947
 
9.3%
4 3936
 
4.1%
ValueCountFrequency (%)
0 1248
45.4%
1 605
22.0%
2 337
 
12.3%
3 256
 
9.3%
4 113
 
4.1%
ValueCountFrequency (%)
0 1248
1.3%
1 605
0.6%
2 337
 
0.3%
3 256
 
0.3%
4 113
 
0.1%
ValueCountFrequency (%)
0 44431
1618.0%
1 19644
715.4%
2 12035
 
438.3%
3 8947
 
325.8%
4 3936
 
143.3%

race
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct66
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Memory size849.2 KiB24.3 KiB
Caucasian
72111 
AfricanAmerican
18269 
Unknown
 
2188
Hispanic
 
1969
Other
 
1441
Caucasian
2109 
AfricanAmerican
503 
Hispanic
 
48
Unknown
 
44
Other
 
30

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st rowCaucasianCaucasian
2nd rowCaucasianCaucasian
3rd rowAfricanAmericanHispanic
4th rowCaucasianCaucasian
5th rowCaucasianCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 72111
74.7%
AfricanAmerican 18269
 
18.9%
Unknown 2188
 
2.3%
Hispanic 1969
 
2.0%
Other 1441
 
1.5%
ValueCountFrequency (%)
Caucasian 2109
76.8%
AfricanAmerican 503
 
18.3%
Hispanic 48
 
1.7%
Unknown 44
 
1.6%
Other 30
 
1.1%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:49.448585image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:49.610706image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct11
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean01
 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum01
Maximum01
Zeros965940
Zeros (%)100.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size1.5 MiB42.9 KiB
2024-10-04T14:44:49.763926image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Minimum01
5-th percentile01
Q101
median01
Q301
95-th percentile01
Maximum01
Range00
Interquartile range (IQR)00

Descriptive statistics

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Standard deviation00
Coefficient of variation (CV)nan0
Kurtosis00
Mean01
Median Absolute Deviation (MAD)00
Skewness00
Sum02.7 × 103
Variance00
MonotonicityIncreasingIncreasing
2024-10-04T14:44:49.924530image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
0 96594
100.0%
ValueCountFrequency (%)
1 2746
100.0%
ValueCountFrequency (%)
0 96594
100.0%
ValueCountFrequency (%)
1 2746
100.0%
ValueCountFrequency (%)
1 2746
2.8%
ValueCountFrequency (%)
0 96594
3517.6%

Age
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct55
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Memory size849.2 KiB24.3 KiB
70+
43017 
[50-70)
38107 
[20-50)
14626 
[10-20)
 
685
[0-10)
 
159
70+
1335 
[50-70)
1011 
[20-50)
394 
[10-20)
 
5
[0-10)
 
1

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique01 ?
Unique (%)0.0%< 0.1%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st row[0-10)70+
2nd row[10-20)[50-70)
3rd row[20-50)70+
4th row[20-50)[50-70)
5th row[20-50)70+

Common Values

ValueCountFrequency (%)
70+ 43017
44.5%
[50-70) 38107
39.5%
[20-50) 14626
 
15.1%
[10-20) 685
 
0.7%
[0-10) 159
 
0.2%
ValueCountFrequency (%)
70+ 1335
48.6%
[50-70) 1011
36.8%
[20-50) 394
 
14.3%
[10-20) 5
 
0.2%
[0-10) 1
 
< 0.1%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:50.090043image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:50.381022image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Race
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct66
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Memory size849.2 KiB24.3 KiB
Caucasian
72111 
AfricanAmerican
18269 
Unknown
 
2188
Hispanic
 
1969
Other
 
1441
Caucasian
2109 
AfricanAmerican
503 
Hispanic
 
48
Unknown
 
44
Other
 
30

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st rowCaucasianCaucasian
2nd rowCaucasianCaucasian
3rd rowAfricanAmericanHispanic
4th rowCaucasianCaucasian
5th rowCaucasianCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 72111
74.7%
AfricanAmerican 18269
 
18.9%
Unknown 2188
 
2.3%
Hispanic 1969
 
2.0%
Other 1441
 
1.5%
ValueCountFrequency (%)
Caucasian 2109
76.8%
AfricanAmerican 503
 
18.3%
Hispanic 48
 
1.7%
Unknown 44
 
1.6%
Other 30
 
1.1%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:50.538209image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:50.698551image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Gender
Categorical

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Distinct22
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Memory size849.1 KiB24.3 KiB
Female
51990 
Male
44604 
Female
1464 
Male
1282 

Unique

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Readmitted within 30 Days_Value=0Readmitted within 30 Days_Value=1
1st rowFemaleFemale
2nd rowFemaleMale
3rd rowFemaleFemale
4th rowMaleMale
5th rowMaleFemale

Common Values

ValueCountFrequency (%)
Female 51990
53.8%
Male 44604
46.2%
ValueCountFrequency (%)
Female 1464
53.3%
Male 1282
46.7%

Common Values (Plot)

Readmitted within 30 Days_Value=0

2024-10-04T14:44:50.842960image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value=1

2024-10-04T14:44:50.961989image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/