Overview

Brought to you by YData

Dataset statistics

Number of variables26
Number of observations99340
Missing cells0
Missing cells (%)0.0%
Total size in memory11.1 MiB
Average record size in memory117.0 B

Variable types

Categorical13
Numeric13

Alerts

A1Cresult is highly imbalanced (54.4%) Imbalance
admission_type_id is highly imbalanced (50.7%) Imbalance
metformin is highly imbalanced (59.1%) Imbalance
race is highly imbalanced (55.8%) Imbalance
Race is highly imbalanced (55.8%) Imbalance
A1C is highly imbalanced (54.4%) Imbalance
Taking Metformin is highly imbalanced (59.1%) Imbalance
num_procedures has 45679 (46.0%) zeros Zeros
Readmitted within 30 Days_Value has 86780 (87.4%) zeros Zeros
Readmitted (Any)_Value has 52524 (52.9%) zeros Zeros
Long Hosptial Stay (>7 days)_Value has 84629 (85.2%) zeros Zeros
Taking Metformin_Value has 78858 (79.4%) zeros Zeros

Reproduction

Analysis started2024-11-13 21:55:03.654378
Analysis finished2024-11-13 21:55:04.126893
Duration0.47 seconds
Software versionydata-profiling vv4.12.0
Download configurationconfig.json

Variables

A1Cresult
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
None
82506 
>8
 
8137
Norm
 
4922
>7
 
3775

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone
2nd rowNorm
3rd rowNone
4th rowNorm
5th row>7

Common Values

ValueCountFrequency (%)
None 82506
83.1%
>8 8137
 
8.2%
Norm 4922
 
5.0%
>7 3775
 
3.8%

Common Values (Plot)

2024-11-13T21:55:04.209005image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

ExpectedHospitalStay
Real number (ℝ)

Distinct99339
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4
Minimum0.57
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:04.389021image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0.57
5-th percentile1.9
Q12.9
median4
Q35.5
95-th percentile8.3
Maximum13
Range13
Interquartile range (IQR)2.5

Descriptive statistics

Standard deviation1.9
Coefficient of variation (CV)0.44
Kurtosis0.55
Mean4.4
Median Absolute Deviation (MAD)1.2
Skewness0.92
Sum4.4 × 105
Variance3.8
MonotonicityNot monotonic
2024-11-13T21:55:04.611945image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.686407099 2
 
< 0.1%
7.420619688 1
 
< 0.1%
4.182485513 1
 
< 0.1%
2.108085838 1
 
< 0.1%
5.271913103 1
 
< 0.1%
Other values (99334) 99334
> 99.9%
ValueCountFrequency (%)
0.5665617094 1
< 0.1%
0.638833411 1
< 0.1%
0.6524052199 1
< 0.1%
0.7074318727 1
< 0.1%
0.7082886994 1
< 0.1%
ValueCountFrequency (%)
13.16913631 1
< 0.1%
13.16394497 1
< 0.1%
12.64407677 1
< 0.1%
12.5914795 1
< 0.1%
12.45426285 1
< 0.1%

Risk30DayReadmission
Real number (ℝ)

Distinct99280
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.11
Minimum0.016
Maximum0.82
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:04.823958image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0.016
5-th percentile0.041
Q10.066
median0.095
Q30.14
95-th percentile0.25
Maximum0.82
Range0.81
Interquartile range (IQR)0.075

Descriptive statistics

Standard deviation0.071
Coefficient of variation (CV)0.62
Kurtosis7.4
Mean0.11
Median Absolute Deviation (MAD)0.034
Skewness2.1
Sum1.1 × 104
Variance0.0051
MonotonicityNot monotonic
2024-11-13T21:55:05.180549image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.02352751151 6
 
< 0.1%
0.02252679675 4
 
< 0.1%
0.02196621592 3
 
< 0.1%
0.07953060927 3
 
< 0.1%
0.1096864509 3
 
< 0.1%
Other values (99275) 99321
> 99.9%
ValueCountFrequency (%)
0.01552467463 1
< 0.1%
0.01571290072 1
< 0.1%
0.01635416876 1
< 0.1%
0.0172317391 1
< 0.1%
0.01735979561 1
< 0.1%
ValueCountFrequency (%)
0.8235683045 1
< 0.1%
0.808131291 1
< 0.1%
0.7847500335 1
< 0.1%
0.7663267905 1
< 0.1%
0.7587227324 1
< 0.1%
Distinct78
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum82
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:05.401647image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q16
median9
Q314
95-th percentile24
Maximum82
Range81
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.1
Coefficient of variation (CV)0.65
Kurtosis7.3
Mean11
Median Absolute Deviation (MAD)3
Skewness2.1
Sum1.1 × 106
Variance51
MonotonicityNot monotonic
2024-11-13T21:55:05.630459image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6 9593
 
9.7%
7 9154
 
9.2%
8 8444
 
8.5%
5 8440
 
8.5%
9 7301
 
7.3%
Other values (73) 56408
56.8%
ValueCountFrequency (%)
1 47
 
< 0.1%
2 1047
 
1.1%
3 3257
 
3.3%
4 6252
6.3%
5 8440
8.5%
ValueCountFrequency (%)
82 1
 
< 0.1%
80 1
 
< 0.1%
78 1
 
< 0.1%
76 1
 
< 0.1%
75 3
< 0.1%

RiskAnyReadmission
Real number (ℝ)

Distinct99336
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.47
Minimum0.043
Maximum0.97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:05.854399image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0.043
5-th percentile0.18
Q10.32
median0.47
Q30.62
95-th percentile0.78
Maximum0.97
Range0.93
Interquartile range (IQR)0.29

Descriptive statistics

Standard deviation0.19
Coefficient of variation (CV)0.4
Kurtosis-0.81
Mean0.47
Median Absolute Deviation (MAD)0.15
Skewness0.12
Sum4.7 × 104
Variance0.035
MonotonicityNot monotonic
2024-11-13T21:55:06.070732image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.6885446923 2
 
< 0.1%
0.5472644277 2
 
< 0.1%
0.1479199291 2
 
< 0.1%
0.6361343963 2
 
< 0.1%
0.2701160744 1
 
< 0.1%
Other values (99331) 99331
> 99.9%
ValueCountFrequency (%)
0.04291416072 1
< 0.1%
0.04718846563 1
< 0.1%
0.05053059496 1
< 0.1%
0.05342341633 1
< 0.1%
0.05499731173 1
< 0.1%
ValueCountFrequency (%)
0.9742692111 1
< 0.1%
0.9730703526 1
< 0.1%
0.9726708088 1
< 0.1%
0.9725797274 1
< 0.1%
0.9698118383 1
< 0.1%

RiskLongStay
Real number (ℝ)

Distinct99339
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.15
Minimum0.0025
Maximum0.95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:06.283305image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0.0025
5-th percentile0.0094
Q10.025
median0.066
Q30.19
95-th percentile0.6
Maximum0.95
Range0.95
Interquartile range (IQR)0.16

Descriptive statistics

Standard deviation0.19
Coefficient of variation (CV)1.3
Kurtosis3.1
Mean0.15
Median Absolute Deviation (MAD)0.05
Skewness1.9
Sum1.5 × 104
Variance0.035
MonotonicityNot monotonic
2024-11-13T21:55:06.493099image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.03402606828 2
 
< 0.1%
0.4274272409 1
 
< 0.1%
0.01100459573 1
 
< 0.1%
0.009910289418 1
 
< 0.1%
0.1922841583 1
 
< 0.1%
Other values (99334) 99334
> 99.9%
ValueCountFrequency (%)
0.002549197336 1
< 0.1%
0.002564021901 1
< 0.1%
0.002569616157 1
< 0.1%
0.002762570192 1
< 0.1%
0.00285376883 1
< 0.1%
ValueCountFrequency (%)
0.9546556141 1
< 0.1%
0.9528038173 1
< 0.1%
0.9526077773 1
< 0.1%
0.9512802711 1
< 0.1%
0.9498346795 1
< 0.1%

admission_type_id
Categorical

Imbalance 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Emergency
70501 
Elective
18667 
Unknown
10144 
Trauma Center
 
18
New Born
 
10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowElective
2nd rowEmergency
3rd rowUnknown
4th rowUnknown
5th rowUnknown

Common Values

ValueCountFrequency (%)
Emergency 70501
71.0%
Elective 18667
 
18.8%
Unknown 10144
 
10.2%
Trauma Center 18
 
< 0.1%
New Born 10
 
< 0.1%

Common Values (Plot)

2024-11-13T21:55:06.663033image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

age
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
70+
44352 
[50-70)
39118 
[20-50)
15020 
[10-20)
 
690
[0-10)
 
160

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row70+
2nd row70+
3rd row[20-50)
4th row70+
5th row70+

Common Values

ValueCountFrequency (%)
70+ 44352
44.6%
[50-70) 39118
39.4%
[20-50) 15020
 
15.1%
[10-20) 690
 
0.7%
[0-10) 160
 
0.2%

Common Values (Plot)

2024-11-13T21:55:06.815358image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Female
53454 
Male
45886 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowMale
3rd rowMale
4th rowFemale
5th rowFemale

Common Values

ValueCountFrequency (%)
Female 53454
53.8%
Male 45886
46.2%

Common Values (Plot)

2024-11-13T21:55:06.950933image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

insulin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
No
46376 
Steady
30069 
Down
11908 
Up
10987 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDown
2nd rowUp
3rd rowNo
4th rowNo
5th rowUp

Common Values

ValueCountFrequency (%)
No 46376
46.7%
Steady 30069
30.3%
Down 11908
 
12.0%
Up 10987
 
11.1%

Common Values (Plot)

2024-11-13T21:55:07.077397image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

metformin
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
No
79497 
Steady
18206 
Up
 
1063
Down
 
574

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSteady
2nd rowNo
3rd rowSteady
4th rowSteady
5th rowNo

Common Values

ValueCountFrequency (%)
No 79497
80.0%
Steady 18206
 
18.3%
Up 1063
 
1.1%
Down 574
 
0.6%

Common Values (Plot)

2024-11-13T21:55:07.218017image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

num_medications
Real number (ℝ)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:07.392267image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q110
median15
Q320
95-th percentile31
Maximum81
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.1
Coefficient of variation (CV)0.51
Kurtosis3.5
Mean16
Median Absolute Deviation (MAD)5
Skewness1.3
Sum1.6 × 106
Variance66
MonotonicityNot monotonic
2024-11-13T21:55:07.615667image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 5976
 
6.0%
12 5888
 
5.9%
11 5696
 
5.7%
15 5694
 
5.7%
14 5592
 
5.6%
Other values (70) 70494
71.0%
ValueCountFrequency (%)
1 260
 
0.3%
2 457
 
0.5%
3 874
0.9%
4 1383
1.4%
5 1964
2.0%
ValueCountFrequency (%)
81 1
 
< 0.1%
79 1
 
< 0.1%
75 2
< 0.1%
74 1
 
< 0.1%
72 3
< 0.1%

num_procedures
Real number (ℝ)

Zeros 

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3
Minimum0
Maximum6
Zeros45679
Zeros (%)46.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:07.791862image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7
Coefficient of variation (CV)1.3
Kurtosis0.88
Mean1.3
Median Absolute Deviation (MAD)1
Skewness1.3
Sum1.3 × 105
Variance2.9
MonotonicityNot monotonic
2024-11-13T21:55:07.936436image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 45679
46.0%
1 20249
20.4%
2 12372
 
12.5%
3 9203
 
9.3%
6 4801
 
4.8%
Other values (2) 7036
 
7.1%
ValueCountFrequency (%)
0 45679
46.0%
1 20249
20.4%
2 12372
 
12.5%
3 9203
 
9.3%
4 4049
 
4.1%
ValueCountFrequency (%)
6 4801
 
4.8%
5 2987
 
3.0%
4 4049
 
4.1%
3 9203
9.3%
2 12372
12.5%

race
Categorical

Imbalance 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.4 KiB
Caucasian
74220 
AfricanAmerican
18772 
Unknown
 
2232
Hispanic
 
2017
Other
 
1471

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCaucasian
2nd rowCaucasian
3rd rowCaucasian
4th rowCaucasian
5th rowCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 74220
74.7%
AfricanAmerican 18772
 
18.9%
Unknown 2232
 
2.2%
Hispanic 2017
 
2.0%
Other 1471
 
1.5%

Common Values (Plot)

2024-11-13T21:55:08.091687image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Readmitted within 30 Days_Value
Real number (ℝ)

Zeros 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.13
Minimum0
Maximum1
Zeros86780
Zeros (%)87.4%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:08.240677image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.33
Coefficient of variation (CV)2.6
Kurtosis3.1
Mean0.13
Median Absolute Deviation (MAD)0
Skewness2.2
Sum1.3 × 104
Variance0.11
MonotonicityNot monotonic
2024-11-13T21:55:08.384976image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
0 86780
87.4%
1 12560
 
12.6%
ValueCountFrequency (%)
0 86780
87.4%
1 12560
 
12.6%
ValueCountFrequency (%)
1 12560
 
12.6%
0 86780
87.4%

Readmitted (Any)_Value
Real number (ℝ)

Zeros 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.47
Minimum0
Maximum1
Zeros52524
Zeros (%)52.9%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:08.524678image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5
Coefficient of variation (CV)1.1
Kurtosis-2
Mean0.47
Median Absolute Deviation (MAD)0
Skewness0.12
Sum4.7 × 104
Variance0.25
MonotonicityNot monotonic
2024-11-13T21:55:08.670383image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
0 52524
52.9%
1 46816
47.1%
ValueCountFrequency (%)
0 52524
52.9%
1 46816
47.1%
ValueCountFrequency (%)
1 46816
47.1%
0 52524
52.9%

Long Hosptial Stay (>7 days)_Value
Real number (ℝ)

Zeros 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.15
Minimum0
Maximum1
Zeros84629
Zeros (%)85.2%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:08.814177image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.36
Coefficient of variation (CV)2.4
Kurtosis1.9
Mean0.15
Median Absolute Deviation (MAD)0
Skewness2
Sum1.5 × 104
Variance0.13
MonotonicityDecreasing
2024-11-13T21:55:08.957721image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
0 84629
85.2%
1 14711
 
14.8%
ValueCountFrequency (%)
0 84629
85.2%
1 14711
 
14.8%
ValueCountFrequency (%)
1 14711
 
14.8%
0 84629
85.2%
Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:09.100844image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile11
Maximum14
Range13
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3
Coefficient of variation (CV)0.68
Kurtosis0.89
Mean4.4
Median Absolute Deviation (MAD)2
Skewness1.1
Sum4.4 × 105
Variance8.8
MonotonicityDecreasing
2024-11-13T21:55:09.267953image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
3 17432
17.5%
2 16891
17.0%
1 13822
13.9%
4 13684
13.8%
5 9749
 
9.8%
Other values (9) 27762
27.9%
ValueCountFrequency (%)
1 13822
13.9%
2 16891
17.0%
3 17432
17.5%
4 13684
13.8%
5 9749
9.8%
ValueCountFrequency (%)
14 995
1.0%
13 1152
1.2%
12 1383
1.4%
11 1770
1.8%
10 2262
2.3%
Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:09.426973image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile11
Maximum14
Range13
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3
Coefficient of variation (CV)0.68
Kurtosis0.89
Mean4.4
Median Absolute Deviation (MAD)2
Skewness1.1
Sum4.4 × 105
Variance8.8
MonotonicityDecreasing
2024-11-13T21:55:09.595826image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
3 17432
17.5%
2 16891
17.0%
1 13822
13.9%
4 13684
13.8%
5 9749
 
9.8%
Other values (9) 27762
27.9%
ValueCountFrequency (%)
1 13822
13.9%
2 16891
17.0%
3 17432
17.5%
4 13684
13.8%
5 9749
9.8%
ValueCountFrequency (%)
14 995
1.0%
13 1152
1.2%
12 1383
1.4%
11 1770
1.8%
10 2262
2.3%

Taking Metformin_Value
Real number (ℝ)

Zeros 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.21
Minimum0
Maximum2
Zeros78858
Zeros (%)79.4%
Negative0
Negative (%)0.0%
Memory size776.2 KiB
2024-11-13T21:55:09.748173image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum2
Range2
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.41
Coefficient of variation (CV)2
Kurtosis0.26
Mean0.21
Median Absolute Deviation (MAD)0
Skewness1.5
Sum2.1 × 104
Variance0.17
MonotonicityNot monotonic
2024-11-13T21:55:09.902670image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
ValueCountFrequency (%)
0 78858
79.4%
1 20423
 
20.6%
2 59
 
0.1%
ValueCountFrequency (%)
0 78858
79.4%
1 20423
 
20.6%
2 59
 
0.1%
ValueCountFrequency (%)
2 59
 
0.1%
1 20423
 
20.6%
0 78858
79.4%

Age
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
70+
44352 
[50-70)
39118 
[20-50)
15020 
[10-20)
 
690
[0-10)
 
160

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row70+
2nd row70+
3rd row[20-50)
4th row70+
5th row70+

Common Values

ValueCountFrequency (%)
70+ 44352
44.6%
[50-70) 39118
39.4%
[20-50) 15020
 
15.1%
[10-20) 690
 
0.7%
[0-10) 160
 
0.2%

Common Values (Plot)

2024-11-13T21:55:10.062698image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Race
Categorical

Imbalance 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.4 KiB
Caucasian
74220 
AfricanAmerican
18772 
Unknown
 
2232
Hispanic
 
2017
Other
 
1471

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCaucasian
2nd rowCaucasian
3rd rowCaucasian
4th rowCaucasian
5th rowCaucasian

Common Values

ValueCountFrequency (%)
Caucasian 74220
74.7%
AfricanAmerican 18772
 
18.9%
Unknown 2232
 
2.2%
Hispanic 2017
 
2.0%
Other 1471
 
1.5%

Common Values (Plot)

2024-11-13T21:55:10.215134image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
Female
53454 
Male
45886 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFemale
2nd rowMale
3rd rowMale
4th rowFemale
5th rowFemale

Common Values

ValueCountFrequency (%)
Female 53454
53.8%
Male 45886
46.2%

Common Values (Plot)

2024-11-13T21:55:10.515622image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

A1C
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
None
82506 
>8
 
8137
Norm
 
4922
>7
 
3775

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone
2nd rowNorm
3rd rowNone
4th rowNorm
5th row>7

Common Values

ValueCountFrequency (%)
None 82506
83.1%
>8 8137
 
8.2%
Norm 4922
 
5.0%
>7 3775
 
3.8%

Common Values (Plot)

2024-11-13T21:55:10.637887image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Taking Insulin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
No
46376 
Steady
30069 
Down
11908 
Up
10987 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDown
2nd rowUp
3rd rowNo
4th rowNo
5th rowUp

Common Values

ValueCountFrequency (%)
No 46376
46.7%
Steady 30069
30.3%
Down 11908
 
12.0%
Up 10987
 
11.1%

Common Values (Plot)

2024-11-13T21:55:10.774670image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Taking Metformin
Categorical

Imbalance 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size97.3 KiB
No
79497 
Steady
18206 
Up
 
1063
Down
 
574

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSteady
2nd rowNo
3rd rowSteady
4th rowSteady
5th rowNo

Common Values

ValueCountFrequency (%)
No 79497
80.0%
Steady 18206
 
18.3%
Up 1063
 
1.1%
Down 574
 
0.6%

Common Values (Plot)

2024-11-13T21:55:10.915087image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/