loading

Data loader report provides detail on issues addressed during most recent REDCap access...

processing date: 2025-06-19 06:09
reading data/data.csv, df.shape=(257, 59)
duplicated record_ids
5      4ce196bbdff36bf4da47b365a8433a16
10     0ba8ee0a0749ef417117a7bbbfea36ae
14     260b69dec89fd672a0ef4b415cab1e05
22     dd9bafda99effda2f73e9aa54ee5441d
25     79fb5958917a2748a9df5c1128e013df
32     65ef887f715f8e2f5f1743bdf0e22674
41     50132229c6f4a8088cc03113436e9d62
49     90f691d8a80a7a0771f4fc50620cec83
58     cc9d73a375f2bb847dfae3f900a3b356
65     2374b05459a6787a8860a7e9674d7a84
71     2d65c3d011f0e383240fbf710f7b76c2
74     276ffcbcd7c9774d786571571b127e4f
106    75c42d0c8f8d19a87dd7e55a8adb6aaf
119    c903ea2f39cf07d3b0c4b96f2516bf0d
130    a73216979646725244e7faa3db08ea1d
134    1c5c4c703c25611a04a8557579b49ce7
137    51219526fc66f7d3aa94fa75ff83fa17
139    829b032c4f5e0a790da4453267842464
142    9be0659ff7e603150cd841c67d6e418c
144    a0998c640bb39c4d66d74771863b0ab0
203    6a3b0bb0617eeb3637ea43b6a27d4a70
Name: record_id, dtype: object 

257 rows read with 236 unique values
236 after removing duplicates
reading data/color.csv

Creating "ned" from "ned_time"

WARNING in ned -- confirming "5+ yrs" for: 91.0
WARNING in ned -- confirming "5+ yrs" for: 82.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 115.0
WARNING in ned -- confirming "5+ yrs" for: 121.0
WARNING in ned -- confirming "5+ yrs" for: 92.0
WARNING in ned -- confirming "5+ yrs" for: 94.0
WARNING in ned -- confirming "5+ yrs" for: 67.0
WARNING in ned -- confirming "5+ yrs" for: 81.0
WARNING in ned -- confirming "5+ yrs" for: 77.0
WARNING in ned -- confirming "5+ yrs" for: 76.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 68.0
WARNING in ned -- confirming "5+ yrs" for: 317.0
WARNING in ned -- confirming "5+ yrs" for: 125.0
WARNING in ned -- confirming "5+ yrs" for: 88.0
WARNING in ned -- confirming "5+ yrs" for: 168.0
WARNING in ned -- confirming "5+ yrs" for: 147.0
WARNING in ned -- confirming "5+ yrs" for: 311.0
WARNING in ned -- confirming "5+ yrs" for: 102.0
WARNING in ned -- confirming "5+ yrs" for: 126.0
WARNING in ned -- confirming "5+ yrs" for: 110.0
WARNING in ned -- confirming "5+ yrs" for: 66.0
WARNING in ned -- confirming "5+ yrs" for: 73.0
WARNING in ned -- confirming "5+ yrs" for: 80.0
WARNING in ned -- confirming "5+ yrs" for: 168.0
WARNING in ned -- confirming "5+ yrs" for: 117.0
WARNING in ned -- confirming "5+ yrs" for: 121.0
WARNING in ned -- confirming "5+ yrs" for: 119.0
WARNING in ned -- confirming "5+ yrs" for: 82.0
WARNING in ned -- confirming "5+ yrs" for: 79.0
WARNING in ned -- confirming "5+ yrs" for: 88.0
WARNING in ned -- confirming "5+ yrs" for: 74.0
WARNING in ned -- confirming "5+ yrs" for: 64.0
WARNING in ned -- confirming "5+ yrs" for: 92.0
Null values: 74 out of 236

Creating "age_group" from "age"

WARNING in age_group -- missing data -- returning None for: nan
Null values: 1 out of 236

Creating "cs_visit" from "cancer_state_visit"

WARNING in cs_visit, returning 2 for cancer_stage_visit: Responding
WARNING in cs_visit, returning 2 for cancer_stage_visit: Responding
WARNING in cs_visit, returning 2 for cancer_stage_visit: Local or regional recurrence/relapse
WARNING in cs_visit, returning 2 for cancer_stage_visit: Progressive disease
WARNING in cs_visit, returning 2 for cancer_stage_visit: 0, no evidence
WARNING in cs_visit, returning 2 for cancer_stage_visit: Progressive disease
WARNING in cs_visit, returning 2 for cancer_stage_visit: PSA rising
WARNING in cs_visit, returning 2 for cancer_stage_visit: 0
WARNING in cs_visit, returning 2 for cancer_stage_visit: Angioimmunoblastic T-cell lymphoma
WARNING in cs_visit, returning 2 for cancer_stage_visit: recurrent pilocytic astrocytoma s/p - chemotherapy  x2, now on observation
WARNING in cs_visit, returning 2 for cancer_stage_visit: Grade II oligodendroglioma
Null values: 128 out of 236

summary stats


-------- summary by question group ---------

236 entries processed on 2025-06-19 06:09 

           age   gender     ecog  ned_time       cs  cs_visit
count  235.000  233.000  235.000   162.000  190.000   108.000
mean    60.860    1.356    0.770    40.352    2.900     0.435
std     17.611    0.480    0.553    47.314    0.946     0.674
min     20.000    1.000    0.000     0.000    1.000     0.000
25%     49.500    1.000    0.000     8.250    2.000     0.000
50%     65.000    1.000    1.000    30.000    3.000     0.000
75%     74.000    2.000    1.000    51.500    4.000     1.000
max     89.000    2.000    2.000   317.000    4.000     2.000

-------- summary by question ---------

236 entries processed on 2025-06-19 06:09

             b1       b2       b3
count  236.000  235.000  236.000
mean     1.314    0.817    0.809
std      1.146    0.963    0.964
min      0.000    0.000    0.000
25%      0.000    0.000    0.000
50%      1.000    1.000    1.000
75%      2.000    1.000    1.000
max      4.000    4.000    4.000

             c1       c2       c3       c4       c5       c6       c7       c8       c9
count  236.000  234.000  232.000  234.000  234.000  228.000  228.000  228.000  229.000
mean     1.695    1.406    2.121    2.765    2.671    1.711    2.070    1.066    2.760
std      0.923    1.007    1.114    1.243    1.260    1.319    1.305    1.172    1.169
min      0.000    0.000    0.000    0.000    0.000    0.000    0.000    0.000    0.000
25%      1.000    1.000    1.000    2.000    2.000    0.000    1.000    0.000    2.000
50%      2.000    1.000    2.000    3.000    3.000    2.000    2.000    1.000    3.000
75%      2.000    2.000    3.000    4.000    4.000    3.000    3.000    2.000    4.000
max      4.000    4.000    4.000    4.000    4.000    4.000    4.000    4.000    4.000

             d1       d2       d3       d4       d5       d6       d7
count  231.000  232.000  230.000  230.000  227.000  233.000  233.000
mean     2.515    2.401    2.409    1.035    2.383    2.682    2.717
std      0.678    0.744    0.685    0.962    0.644    0.493    0.539
min      0.000    0.000    0.000    0.000    0.000    1.000    0.000
25%      2.000    2.000    2.000    0.000    2.000    2.000    3.000
50%      3.000    3.000    2.000    1.000    2.000    3.000    3.000
75%      3.000    3.000    3.000    2.000    3.000    3.000    3.000
max      3.000    3.000    3.000    3.000    3.000    3.000    3.000

             e1       e2       e3       e4       e5       e6       e7
count  227.000  227.000  227.000  228.000  228.000  228.000  228.000
mean     2.282    2.132    1.599    2.118    3.018    1.614    1.482
std      1.047    1.000    0.858    1.134    1.110    1.015    0.852
min      1.000    1.000    1.000    1.000    1.000    1.000    1.000
25%      1.000    1.000    1.000    1.000    2.000    1.000    1.000
50%      2.000    2.000    1.000    2.000    3.000    1.000    1.000
75%      3.000    3.000    2.000    3.000    4.000    2.000    2.000
max      4.000    4.000    4.000    4.000    4.000    4.000    4.000

tidy data: merged.shape=(6136, 17)
writing data/tidy.csv