This function provides a mechanism to divide large data into small chunks.

pre_process(data, init_res_status_data = NULL, n_groups = 1)

Arguments

data

A dataframe object.

init_res_status_data

The raw data of the initial residence status in the format of data frame.

n_groups

The number of groups required to be returned.

Value

A list object contains reformatted raw data.

Examples

## to suppresse log messages to the console migrbc::initialize_logger(log_level = 1)
#> logger 'migrbc' has log_level=1 and writes to the console
number_of_people = 10 person_data <- migrbc::setup_random_test_data(number_of_people, initial_date = '2001-01-01', numJourneys = 5, min = 0, max = 10) crossings <- migrbc::pre_process(person_data, n_groups = 10) crossings
#> $`1` #> $`1`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 1 1 1 1 2001-01-01 1 #> 2 2 1 0 2001-01-02 2 #> 3 3 1 1 2001-01-07 3 #> 4 4 1 0 2001-01-16 4 #> 5 5 1 1 2001-01-25 5 #> res_status_before res_status_after journeyId_prev #> 1 0 0 0 #> 2 0 0 0 #> 3 0 0 0 #> 4 0 0 0 #> 5 0 0 0 #> #> $`1`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`2` #> $`2`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 6 1 2 1 2001-01-01 1 #> 7 2 2 0 2001-01-03 2 #> 8 3 2 1 2001-01-06 3 #> 9 4 2 0 2001-01-13 4 #> 10 5 2 1 2001-01-21 5 #> res_status_before res_status_after journeyId_prev #> 6 0 0 0 #> 7 0 0 0 #> 8 0 0 0 #> 9 0 0 0 #> 10 0 0 0 #> #> $`2`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`3` #> $`3`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 11 1 3 1 2001-01-01 1 #> 12 2 3 0 2001-01-03 2 #> 13 3 3 1 2001-01-05 3 #> 14 4 3 0 2001-01-08 4 #> 15 5 3 1 2001-01-18 5 #> res_status_before res_status_after journeyId_prev #> 11 0 0 0 #> 12 0 0 0 #> 13 0 0 0 #> 14 0 0 0 #> 15 0 0 0 #> #> $`3`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`4` #> $`4`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 16 1 4 1 2001-01-01 1 #> 17 2 4 0 2001-01-03 2 #> 18 3 4 1 2001-01-06 3 #> 19 4 4 0 2001-01-11 4 #> 20 5 4 1 2001-01-17 5 #> res_status_before res_status_after journeyId_prev #> 16 0 0 0 #> 17 0 0 0 #> 18 0 0 0 #> 19 0 0 0 #> 20 0 0 0 #> #> $`4`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`5` #> $`5`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 21 1 5 1 2001-01-01 1 #> 22 2 5 0 2001-01-01 2 #> 23 3 5 1 2001-01-03 3 #> 24 4 5 0 2001-01-08 4 #> 25 5 5 1 2001-01-16 5 #> res_status_before res_status_after journeyId_prev #> 21 0 0 0 #> 22 0 0 0 #> 23 0 0 0 #> 24 0 0 0 #> 25 0 0 0 #> #> $`5`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`6` #> $`6`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 26 1 6 1 2001-01-01 1 #> 27 2 6 0 2001-01-03 2 #> 28 3 6 1 2001-01-08 3 #> 29 4 6 0 2001-01-14 4 #> 30 5 6 1 2001-01-22 5 #> res_status_before res_status_after journeyId_prev #> 26 0 0 0 #> 27 0 0 0 #> 28 0 0 0 #> 29 0 0 0 #> 30 0 0 0 #> #> $`6`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`7` #> $`7`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 31 1 7 1 2001-01-01 1 #> 32 2 7 0 2001-01-01 2 #> 33 3 7 1 2001-01-03 3 #> 34 4 7 0 2001-01-07 4 #> 35 5 7 1 2001-01-16 5 #> res_status_before res_status_after journeyId_prev #> 31 0 0 0 #> 32 0 0 0 #> 33 0 0 0 #> 34 0 0 0 #> 35 0 0 0 #> #> $`7`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`8` #> $`8`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 36 1 8 1 2001-01-01 1 #> 37 2 8 0 2001-01-02 2 #> 38 3 8 1 2001-01-04 3 #> 39 4 8 0 2001-01-10 4 #> 40 5 8 1 2001-01-18 5 #> res_status_before res_status_after journeyId_prev #> 36 0 0 0 #> 37 0 0 0 #> 38 0 0 0 #> 39 0 0 0 #> 40 0 0 0 #> #> $`8`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`9` #> $`9`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 41 1 9 1 2001-01-01 1 #> 42 2 9 0 2001-01-08 2 #> 43 3 9 1 2001-01-16 3 #> 44 4 9 0 2001-01-25 4 #> 45 5 9 1 2001-02-03 5 #> res_status_before res_status_after journeyId_prev #> 41 0 0 0 #> 42 0 0 0 #> 43 0 0 0 #> 44 0 0 0 #> 45 0 0 0 #> #> $`9`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> $`10` #> $`10`$group_data #> journeyId personId is_arrival date_crossing journey_sequence #> 46 1 10 1 2001-01-01 1 #> 47 2 10 0 2001-01-06 2 #> 48 3 10 1 2001-01-12 3 #> 49 4 10 0 2001-01-21 4 #> 50 5 10 1 2001-01-31 5 #> res_status_before res_status_after journeyId_prev #> 46 0 0 0 #> 47 0 0 0 #> 48 0 0 0 #> 49 0 0 0 #> 50 0 0 0 #> #> $`10`$initial_status_group #> [1] res_status_initial personId date_finalised #> <0 rows> (or 0-length row.names) #> #> #> attr(,"class") #> [1] "migrbc_preprocess" "list"