Merge Datasets
We merge the datasets using erisim_numarasi (Case ID) as the key.
Calculate Intervals and Quality Control
We calculate the time intervals between key stages and apply quality control filters.
[1] "Total Merged Rows: 47759"
# A tibble: 6 × 21
erisim_numarasi staining_complete_time first_assignment_after_staining
<chr> <dttm> <dttm>
1 10-25 2025-01-02 15:34:52 2025-01-02 16:00:23
2 1000-25 2025-01-09 10:15:30 2025-01-09 10:21:32
3 10000-25 2025-02-22 13:46:10 2025-02-22 14:06:12
4 10001-25 2025-02-24 13:23:11 2025-02-24 13:57:21
5 10008-25 2025-02-21 16:51:34 2025-02-22 11:44:40
6 1001-25 2025-01-09 10:15:30 2025-01-09 10:22:04
# ℹ 18 more variables: lab_finish_time <dttm>, used_fallback <lgl>,
# description_local <chr>, scan_complete_time <dttm>, scan_start_time <dttm>,
# slide_count_scanner <int>, scanner_name_log <chr>, copied_files <chr>,
# min_pacs_upload_time <dttm>, max_pacs_upload_time <dttm>,
# slide_count_pacs <int>, scanner_model_pacs <chr>, case_year <dbl>,
# time_lab_to_scan <dbl>, time_scan_duration <dbl>, time_scan_to_pacs <dbl>,
# time_total_turnaround <dbl>, turnaround_category <chr>
Save Merged Data
We save the final merged dataset to data/processed/merged_data_case_level.rds, as well as CSV and Excel formats.