9  Combine Data

9.1 Load Intermediate Data

We load the processed summaries from the previous steps.

9.2 Merge Datasets

We merge the datasets using erisim_numarasi (Case ID) as the key.

9.3 Calculate Intervals and Quality Control

We calculate the time intervals between key stages and apply quality control filters.

[1] "Total Merged Rows: 47759"
# A tibble: 6 × 21
  erisim_numarasi staining_complete_time first_assignment_after_staining
  <chr>           <dttm>                 <dttm>                         
1 10-25           2025-01-02 15:34:52    2025-01-02 16:00:23            
2 1000-25         2025-01-09 10:15:30    2025-01-09 10:21:32            
3 10000-25        2025-02-22 13:46:10    2025-02-22 14:06:12            
4 10001-25        2025-02-24 13:23:11    2025-02-24 13:57:21            
5 10008-25        2025-02-21 16:51:34    2025-02-22 11:44:40            
6 1001-25         2025-01-09 10:15:30    2025-01-09 10:22:04            
# ℹ 18 more variables: lab_finish_time <dttm>, used_fallback <lgl>,
#   description_local <chr>, scan_complete_time <dttm>, scan_start_time <dttm>,
#   slide_count_scanner <int>, scanner_name_log <chr>, copied_files <chr>,
#   min_pacs_upload_time <dttm>, max_pacs_upload_time <dttm>,
#   slide_count_pacs <int>, scanner_model_pacs <chr>, case_year <dbl>,
#   time_lab_to_scan <dbl>, time_scan_duration <dbl>, time_scan_to_pacs <dbl>,
#   time_total_turnaround <dbl>, turnaround_category <chr>

9.4 Save Merged Data

We save the final merged dataset to data/processed/merged_data_case_level.rds, as well as CSV and Excel formats.