Function for writing an HMDP model to binary files. The function defines sub-functions which can be used to define an HMDP model saved in a set of binary files.
Source:R/writer.R
binaryMDPWriter.RdBinary files are efficient for storing large models. Compared to the HMP (XML) format the binary files use less storage space and loads the model faster.
Usage
binaryMDPWriter(
prefix = "",
binNames = c("stateIdx.bin", "stateIdxLbl.bin", "actionIdx.bin", "actionIdxLbl.bin",
"actionWeight.bin", "actionWeightLbl.bin", "transProb.bin", "externalProcesses.bin",
"transWeight.bin", "transWeightLbl.bin"),
getLog = TRUE
)Details
The returned writer exposes these functions:
setWeights(labels, ...): sets the labels of the weights used in the actions.labelsis a vector of label names....is currently ignored. Call this before building the model.process(): starts a (sub)process. It may also be used to specify a traditional MDP using matrices inMDPtoolboxstyle. In that style,Pis a list of matrices, one per action, each of size$S x S$where$S$is the number of states. Each used row must sum to one, or all entries in a row must be zero if unused.Ris a matrix of size$S x A$, where$A$is the number of actions, andDis a matrix of size$S x A$with durations. IfDis omitted, all durations are assumed to be 1.endProcess(): ends a (sub)process.stage(label = NULL): starts a stage.labelis currently unused in the binary format.endStage(): ends a stage.state(label = NULL): starts a state and returns, invisibly, the state id. That id can later be referenced with scope 3.endState(): ends a state.action(scope = NULL, id = NULL, pr = NULL, prob = NULL, weights, transWeights = NULL, label = NULL, end = FALSE, ...): starts an action.weightsmust be a vector of action weights.transWeightsmust contain transition weights ordered by transition, with all transition weight labels for the first transition followed by all labels for the second transition, and so on. Transition probabilities can be entered in two ways:probcontains triples(scope, id, pr).idandprare vectors of equal length. Ifscopeis omitted, all scopes default to 1.
See the description of
actionIdx.binbelow. Ifend = TRUE, callingendAction()is not necessary....is currently ignored.endAction(): ends an action. Do not use this ifend = TRUEwas used when the action was specified.includeProcess(prefix, label = NULL, weights, prob, termStates, transWeights = NULL): includes an external process. External processes are loaded into memory only when needed, which helps with large models.prefixis the external process prefix.weightsmust be a vector of action weights, andprobmust contain triples(scope, idx, pr); see the description ofactionIdx.binbelow.termStatesmust specify the number of states at the last stage in the external process. Inside anincludeProcess ... endIncludeProcessblock, you must specify the father jump actions of the last stage in the external process. The external process is represented by its first and last stage together with its jump actions. The function returns, invisibly, the state ids of the first stage in the external process, which can later be referenced with scope 3.endIncludeProcess(): ends anincludeProcessblock.closeWriter(): closes the writer. Call this when the model description is finished.
Ten binary files are created:
stateIdx.bin: integers defining all states in the format"n0 s0 -1 n0 s0 a0 n1 s1 -1 n0 s0 a0 n1 s1 a1 n2 s2 -1 n0 s0 ...". Here-1indicates that a new state is considered.stateIdxLbl.bin: character data in the formatsIdx label sIdx label .... HeresIdxcorresponds to the index or line number instateIdxLbl.bin, starting from 0. No delimiter is used.actionIdx.bin: integers defining all actions in the formatsIdx scope idx scope idx scope idx -1 sIdx scope idx scope idx -1 sIdx scope -1 ....sIdxcorresponds to the index or line number instateIdx.bin, starting from 0. The following(scope, idx)pairs indicate possible transitions. Scope can take four values:2: a transition to a child process, at stage zero in the child process.1: a transition to the next stage in the current process.0: a transition to the next stage in the father process.3: a transition to a state specified by its statesIdx.
For example, if
scope = 1andidx = 2, the transition is to state number 3 at the next stage in the current process. Ifscope = 3andidx = 5, the transition is to the state specified at line 6 instateIdxLbl.bin. This is useful when considering shared child processes.actionIdxLbl.bin: character data in the formataIdx label aIdx label .... HereaIdxcorresponds to the index or line number inactionIdx.bin, starting from 0. No delimiter is used.actionWeight.bin: doubles containing action weights in the format"c1 c2 c3 c1 c2 c3 ...", assuming three weights for each action.actionWeightLbl.bin: character data containing the weight labels in the formatlabel1 label2 label3, assuming three weights for each action.transProb.bin: doubles containing transition probabilities defined inactionIdx.bin. The format is"p1 p2 p3 -1 p1 -1 p1 p2 -1 ...". Here-1indicates that a new action is considered.externalProcesses.bin: character data containing links to external processes in the formatstageStr prefix stageStr prefix .... HerestageStrcorresponds to the stage index, for examplen0 s0 a0 n1, of the stage corresponding to the first stage in the external process, andprefixis the external process prefix. No delimiter is used.transWeight.bin: doubles containing transition weights in the format"t11 t12 t21 t22 -1 ...", assuming two transition weights for each transition and two transitions in the first action.transWeightLbl.bin: character data containing the transition weight labels.
Examples
## Use temp dir
wd <- setwd(tempdir())
# Create a small HMDP with two levels
w<-binaryMDPWriter()
w$setWeights(c("Duration","Net reward","Items"))
w$process()
w$stage()
w$state(label="M0")
w$action(label="A0",weights=c(0,0,0),prob=c(2,0,1))
w$process()
w$stage()
w$state(label="D")
w$action(label="A0",weights=c(0,0,1),prob=c(1,0,0.5,1,1,0.5))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(0,0,0),prob=c(1,0,1))
w$endAction()
w$action(label="A1",weights=c(1,2,1),prob=c(1,0,0.5,1,1,0.5))
w$endAction()
w$endState()
w$state(label="C1")
w$action(label="A0",weights=c(0,0,0),prob=c(1,0,1))
w$endAction()
w$action(label="A1",weights=c(1,2,1),prob=c(1,0,0.5,1,1,0.5))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$endState()
w$state(label="C1")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$endState()
w$endStage()
w$endProcess()
w$endAction()
w$action(label="A1",weights=c(0,0,0),prob=c(2,0,1))
w$process()
w$stage()
w$state(label="D")
w$action(label="A0",weights=c(0,0,1),prob=c(1,0,1))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(0,0,0),prob=c(1,0,1))
w$endAction()
w$action(label="A1",weights=c(1,2,1),prob=c(1,0,0.5,1,1,0.5))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$endState()
w$state(label="C1")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$action(label="A1",weights=c(0,10,5),prob=c(0,0,0.5,0,1,0.5))
w$endAction()
w$endState()
w$endStage()
w$endProcess()
w$endAction()
w$endState()
w$state(label="M1")
w$action(label="A0",weights=c(0,0,0),prob=c(2,0,1))
w$process()
w$stage()
w$state(label="D")
w$action(label="A0",weights=c(0,0,1),prob=c(1,0,0.5,1,1,0.5))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(0,0,0),prob=c(1,0,1))
w$endAction()
w$endState()
w$state(label="C1")
w$action(label="A0",weights=c(0,0,0),prob=c(1,0,1))
w$endAction()
w$endState()
w$endStage()
w$stage()
w$state(label="C0")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$endState()
w$state(label="C1")
w$action(label="A0",weights=c(1,4,0),prob=c(0,0,1))
w$endAction()
w$endState()
w$endStage()
w$endProcess()
w$endAction()
w$endState()
w$endStage()
w$endProcess()
w$closeWriter()
#>
#> Statistics:
#> states : 16
#> actions: 21
#> weights: 3
#>
#> Closing binary MDP writer.
#>
## Info about the binary files (don't have to load the model first)
getBinInfoStates()
#> # A tibble: 16 × 3
#> sId stageStr label
#> <dbl> <chr> <chr>
#> 1 0 0,0 M0
#> 2 1 0,0,0,0,0 D
#> 3 2 0,0,0,1,0 C0
#> 4 3 0,0,0,1,1 C1
#> 5 4 0,0,0,2,0 C0
#> 6 5 0,0,0,2,1 C1
#> 7 6 0,0,1,0,0 D
#> 8 7 0,0,1,1,0 C0
#> 9 8 0,0,1,2,0 C0
#> 10 9 0,0,1,2,1 C1
#> 11 10 0,1 M1
#> 12 11 0,1,0,0,0 D
#> 13 12 0,1,0,1,0 C0
#> 14 13 0,1,0,1,1 C1
#> 15 14 0,1,0,2,0 C0
#> 16 15 0,1,0,2,1 C1
getBinInfoActions()
#> # A tibble: 21 × 9
#> aId sId scope index pr Duration `Net reward` Items label
#> <dbl> <int> <chr> <chr> <chr> <dbl> <dbl> <dbl> <chr>
#> 1 0 0 2 0 1 0 0 0 A0
#> 2 1 1 1,1 0,1 0.5,0.5 0 0 1 A0
#> 3 2 2 1 0 1 0 0 0 A0
#> 4 3 2 1,1 0,1 0.5,0.5 1 2 1 A1
#> 5 4 3 1 0 1 0 0 0 A0
#> 6 5 3 1,1 0,1 0.5,0.5 1 2 1 A1
#> 7 6 4 0 0 1 1 4 0 A0
#> 8 7 5 0 0 1 1 4 0 A0
#> 9 8 0 2 0 1 0 0 0 A1
#> 10 9 6 1 0 1 0 0 1 A0
#> # ℹ 11 more rows
## reset working dir
setwd(wd)