loading
Data loader report provides detail on issues addressed during most recent REDCap access...
processing date: 2025-06-19 06:09
reading data/data.csv, df.shape=(257, 59)
duplicated record_ids
5 4ce196bbdff36bf4da47b365a8433a16
10 0ba8ee0a0749ef417117a7bbbfea36ae
14 260b69dec89fd672a0ef4b415cab1e05
22 dd9bafda99effda2f73e9aa54ee5441d
25 79fb5958917a2748a9df5c1128e013df
32 65ef887f715f8e2f5f1743bdf0e22674
41 50132229c6f4a8088cc03113436e9d62
49 90f691d8a80a7a0771f4fc50620cec83
58 cc9d73a375f2bb847dfae3f900a3b356
65 2374b05459a6787a8860a7e9674d7a84
71 2d65c3d011f0e383240fbf710f7b76c2
74 276ffcbcd7c9774d786571571b127e4f
106 75c42d0c8f8d19a87dd7e55a8adb6aaf
119 c903ea2f39cf07d3b0c4b96f2516bf0d
130 a73216979646725244e7faa3db08ea1d
134 1c5c4c703c25611a04a8557579b49ce7
137 51219526fc66f7d3aa94fa75ff83fa17
139 829b032c4f5e0a790da4453267842464
142 9be0659ff7e603150cd841c67d6e418c
144 a0998c640bb39c4d66d74771863b0ab0
203 6a3b0bb0617eeb3637ea43b6a27d4a70
Name: record_id, dtype: object
257 rows read with 236 unique values
236 after removing duplicates
reading data/color.csv
Creating "ned" from "ned_time"
WARNING in ned -- confirming "5+ yrs" for: 91.0
WARNING in ned -- confirming "5+ yrs" for: 82.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 115.0
WARNING in ned -- confirming "5+ yrs" for: 121.0
WARNING in ned -- confirming "5+ yrs" for: 92.0
WARNING in ned -- confirming "5+ yrs" for: 94.0
WARNING in ned -- confirming "5+ yrs" for: 67.0
WARNING in ned -- confirming "5+ yrs" for: 81.0
WARNING in ned -- confirming "5+ yrs" for: 77.0
WARNING in ned -- confirming "5+ yrs" for: 76.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 317.0
WARNING in ned -- confirming "5+ yrs" for: 125.0
WARNING in ned -- confirming "5+ yrs" for: 88.0
WARNING in ned -- confirming "5+ yrs" for: 168.0
WARNING in ned -- confirming "5+ yrs" for: 147.0
WARNING in ned -- confirming "5+ yrs" for: 311.0
WARNING in ned -- confirming "5+ yrs" for: 102.0
WARNING in ned -- confirming "5+ yrs" for: 126.0
WARNING in ned -- confirming "5+ yrs" for: 110.0
WARNING in ned -- confirming "5+ yrs" for: 66.0
WARNING in ned -- confirming "5+ yrs" for: 73.0
WARNING in ned -- confirming "5+ yrs" for: 80.0
WARNING in ned -- confirming "5+ yrs" for: 168.0
WARNING in ned -- confirming "5+ yrs" for: 117.0
WARNING in ned -- confirming "5+ yrs" for: 121.0
WARNING in ned -- confirming "5+ yrs" for: 119.0
WARNING in ned -- confirming "5+ yrs" for: 82.0
WARNING in ned -- confirming "5+ yrs" for: 79.0
WARNING in ned -- confirming "5+ yrs" for: 88.0
WARNING in ned -- confirming "5+ yrs" for: 74.0
WARNING in ned -- confirming "5+ yrs" for: 64.0
WARNING in ned -- confirming "5+ yrs" for: 92.0
Null values: 74 out of 236
Creating "age_group" from "age"
WARNING in age_group -- missing data -- returning None for: nan
Null values: 1 out of 236
Creating "cs_visit" from "cancer_state_visit"
WARNING in cs_visit, returning 2 for cancer_stage_visit: Responding
WARNING in cs_visit, returning 2 for cancer_stage_visit: Responding
WARNING in cs_visit, returning 2 for cancer_stage_visit: Local or regional recurrence/relapse
WARNING in cs_visit, returning 2 for cancer_stage_visit: Progressive disease
WARNING in cs_visit, returning 2 for cancer_stage_visit: 0, no evidence
WARNING in cs_visit, returning 2 for cancer_stage_visit: Progressive disease
WARNING in cs_visit, returning 2 for cancer_stage_visit: PSA rising
WARNING in cs_visit, returning 2 for cancer_stage_visit: 0
WARNING in cs_visit, returning 2 for cancer_stage_visit: Angioimmunoblastic T-cell lymphoma
WARNING in cs_visit, returning 2 for cancer_stage_visit: recurrent pilocytic astrocytoma s/p - chemotherapy x2, now on observation
WARNING in cs_visit, returning 2 for cancer_stage_visit: Grade II oligodendroglioma
Null values: 128 out of 236
summary stats
-------- summary by question group ---------
236 entries processed on 2025-06-19 06:09
age gender ecog ned_time cs cs_visit
count 235.000 233.000 235.000 162.000 190.000 108.000
mean 60.860 1.356 0.770 40.352 2.900 0.435
std 17.611 0.480 0.553 47.314 0.946 0.674
min 20.000 1.000 0.000 0.000 1.000 0.000
25% 49.500 1.000 0.000 8.250 2.000 0.000
50% 65.000 1.000 1.000 30.000 3.000 0.000
75% 74.000 2.000 1.000 51.500 4.000 1.000
max 89.000 2.000 2.000 317.000 4.000 2.000
-------- summary by question ---------
236 entries processed on 2025-06-19 06:09
b1 b2 b3
count 236.000 235.000 236.000
mean 1.314 0.817 0.809
std 1.146 0.963 0.964
min 0.000 0.000 0.000
25% 0.000 0.000 0.000
50% 1.000 1.000 1.000
75% 2.000 1.000 1.000
max 4.000 4.000 4.000
c1 c2 c3 c4 c5 c6 c7 c8 c9
count 236.000 234.000 232.000 234.000 234.000 228.000 228.000 228.000 229.000
mean 1.695 1.406 2.121 2.765 2.671 1.711 2.070 1.066 2.760
std 0.923 1.007 1.114 1.243 1.260 1.319 1.305 1.172 1.169
min 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
25% 1.000 1.000 1.000 2.000 2.000 0.000 1.000 0.000 2.000
50% 2.000 1.000 2.000 3.000 3.000 2.000 2.000 1.000 3.000
75% 2.000 2.000 3.000 4.000 4.000 3.000 3.000 2.000 4.000
max 4.000 4.000 4.000 4.000 4.000 4.000 4.000 4.000 4.000
d1 d2 d3 d4 d5 d6 d7
count 231.000 232.000 230.000 230.000 227.000 233.000 233.000
mean 2.515 2.401 2.409 1.035 2.383 2.682 2.717
std 0.678 0.744 0.685 0.962 0.644 0.493 0.539
min 0.000 0.000 0.000 0.000 0.000 1.000 0.000
25% 2.000 2.000 2.000 0.000 2.000 2.000 3.000
50% 3.000 3.000 2.000 1.000 2.000 3.000 3.000
75% 3.000 3.000 3.000 2.000 3.000 3.000 3.000
max 3.000 3.000 3.000 3.000 3.000 3.000 3.000
e1 e2 e3 e4 e5 e6 e7
count 227.000 227.000 227.000 228.000 228.000 228.000 228.000
mean 2.282 2.132 1.599 2.118 3.018 1.614 1.482
std 1.047 1.000 0.858 1.134 1.110 1.015 0.852
min 1.000 1.000 1.000 1.000 1.000 1.000 1.000
25% 1.000 1.000 1.000 1.000 2.000 1.000 1.000
50% 2.000 2.000 1.000 2.000 3.000 1.000 1.000
75% 3.000 3.000 2.000 3.000 4.000 2.000 2.000
max 4.000 4.000 4.000 4.000 4.000 4.000 4.000
tidy data: merged.shape=(6136, 17)
writing data/tidy.csv