Index Arguments in the Generic Sub-setting Methods

There are several types of arguments that can be used in the generic methods of 'squarebrackets' to specify the indices to perform operations on:

i, use: to specify interior (i.e. dimensionless) indices.
s, use: to specify subscripts of arbitrary dimensions in arrays.
row, col, use: to specify rows and columns in tabular objects (matrices and data.frames).
slice, use: to specify indices of one particular dimension (for arrays and data.frame-like objects).
Currently only used in the _icom methods.

For the fundamentals of indexing in 'squarebrackets', see squarebrackets_indx_fundamentals.
In this help page x refers to the object on which subset operations are performed.

Argument Pair `i, use`

[class: atomic vector]
[class: derived atomic vector]
[class: recursive vector]
[class: atomic array]
[class: recursive array]

i specifies the interior (or flat) indices.
use can be 1 or -1:
1 means the specified indices in i are to be used for the sub-set operation.
-1L means all indices except the indices in i are to be used for the sub-set operation.

Any of the following can be specified for argument i:

NULL or 0L, corresponds to missing argument.
a vector of length 0, in which case no indices are selected for the operation (i.e. empty selection).
a numeric vector of strictly positive whole numbers giving indices.
a logical vector, of the same length as x, giving the indices to select for the operation.
a character vector of index names.
If an object has multiple indices with the given name, ALL the corresponding indices will be selected for the operation.
a function that takes as input x, and returns a logical vector, giving the element indices to select for the operation.
For atomic objects, i is interpreted as i(x).
For recursive objects, i is interpreted as lapply(x, i).
a formula; see keywords.

Using the i arguments corresponds to doing something like the following:

 ii_x(x, i = i) # ==> x[i]   # if `x` is atomic
 ii_x(x, i = i) # ==> x[i]  # if `x` is recursive

If i is a function, it corresponds to the following:

 ii_x(x, i = i) # ==> x[i(x)] # if `x` is atomic
 ii_x(x, i = i) # ==> x[lapply(x, i)] # if `x` is recursive

Argument Pair `s, use`

[class: atomic array]
[class: recursive array]
The s, use argument pair, inspired by the abind::asub function from the 'abind' package, is the primary indexing argument for sub-set operations on dimensional objects.

The s argument specifies the subscripts (i.e. dimensional indices).
The absolute value of use argument gives the dimensions for which the s holds (i.e. use specifies the "non-missing" margins), and the sign (+ or -) of use specifies if the indices are to be selected or excluded, similar to the use argument used in combination with i (see previous section).

Specifically, the use argument can be any of the following:

a vector of length 0, which will be interpreted as "all subscripts are missing".
an integer vector;
the real scalar Inf, which will be interpreted as 1:dim(x);
the real scalar -Inf, which will be interpreted as -1:-ndim(x).

use is not allowed to have any duplicate values, nor is it allowed to include zero (0).
The default value for use is lazy evaluated 1:ndim(x).

s must be an atomic vector, a list of length 1, or a list of the same length as use.
If s is a list of length 1, it is internally recycled to become the same length as use.
If s is an atomic vector, it is internally treated as list(s), and (as with the previous case) recycled to become the same length as use.

Each element of s when s is a list, or s as a whole when s is atomic, can be any of the following:

NULL or 0L, which corresponds to a missing index argument.
a vector of length 0, in which case no indices are selected for the operation (i.e. empty selection).
A formula, with optional keywords, giving an expression to evaluate.
See keywords.
a numeric vector of strictly positive whole numbers with indices of the specified dimension to select for the operation.
a logical vector of the same length as the corresponding dimension size, giving the indices of the specified dimension to select for the operation.
a character vector giving the dimnames to select.
If a dimension has multiple indices with the given name, ALL the corresponding indices will be selected for the operation.
a formula; see keywords.

To keep the syntax short, the user can use the n function instead of list() to specify s.

EXAMPLES
Here are some examples for clarity, using an atomic array x of 3 dimensions:

ss_x(x, n(1:10, 1:5), c(1, -3)), extracts the first 10 rows, extracts all columns, and removes the first 5 layers, of array x.
ss_x(x, n(1:10, 0L, 1:5)) is the same.
ss_x(x, n(1:10), 2)
extracts the first 10 columns of array x.
ss_x(x, 1:10),
extracts the first 10 rows, columns, and layers of array x.
ss_x(x, 1:10, c(1, 3)),
extracts the first 10 rows, all columns, and the first 10 layers, of array x.

I.e.:


ss_x(x, n(1:10, 1:5), c(1, -3)) # ==> x[1:10, , -1:-5, drop = FALSE]

ss_x(x, n(1:10, 0L, 1:5))      # ==> x[1:10, , 1:5, drop = FALSE]

ss_x(x, 1:10, 2)               # ==> x[ , 1:10, , drop = FALSE]

ss_x(x, 1:10)                  # ==> x[1:10, 1:10, 1:10, drop = FALSE]

ss_x(x, 1:10, c(1, 3))         # ==> x[1:10, , 1:10, drop = FALSE]

For a brief explanation on the relationship between flat indices (i) and subscripts (s, use) in arrays, see ss2ii.

Arguments set `row, col, use`

[class: atomic matrix]
[class: recursive matrix]

Specifies rows and columns in a matrix. The argument row and col can each be any of the following:

NULL or 0L, which corresponds to a missing index argument.
a vector of length 0, in which case no indices are selected for the operation (i.e. empty selection).
a numeric vector of strictly positive whole numbers with dimension indices to select for the operation.
a logical vector of the same length as the corresponding dimension size, giving the dimension indices to select for the operation.
a character vector of index names.
If a dimension has multiple indices with the given name, ALL the corresponding indices will be selected for the operation.
a formula; see keywords.

For data.frames, col can also be a function.

use is set to 1:2 by default.
If use is c(-1, 2) or -1, the row indices will be inverted (i.e. select all rows except those specified in row).
If use is c(1, -2) or -2, the column indices will be inverted (i.e. select all columns except those specified in col).
If use is -1:-2, both the row- and column- indices will be inverted.
The order of use is irrelevant; i.e. c(-1, 2) is the same as c(2, -1).

Argument Pair `slice, use`

[class: atomic array]
[class: recursive array]
[class: data.frame-like]

Relevant only for the _icom methods.
The absolute value of use specifies the dimension on which argument slice is used.
The sign of use, just like in the previous argument sets, specifies whether to invert the indices or not.
I.e. if use = 1 , slice specifies which rows to select;
if use = -2, slice specifies which columns to not select;
etc.

The slice argument can be any of the following:

NULL or 0L, which corresponds to a missing index argument.
a numeric vector of strictly positive whole numbers with dimension indices to select for the operation.
a logical vector of the same length as the corresponding dimension size, giving the dimension indices to select for the operation.
a character vector of index names.
If a dimension has multiple indices with the given name, ALL the corresponding indices will be selected for the operation.
a formula; see keywords.

One could also give a vector of length 0 for slice;
Argument slice is only used in the _icom methods, and the results are meant to be used inside the regular [ and [<- operators.
Thus the effect of a zero-length index specification depends on the rule-set of [.class(x) and [<-.class(x).

All Missing Indices

NULL and 0L in the indexing arguments correspond to a missing argument.
For s, use, specifying use of length 0 also corresponds to all subscripts being missing.
Thus, for the _x methods, using missing indexing arguments for all indexing arguments corresponds to something like the following:

x[]

Similarly, for the _mod and _set methods, using missing or NULL indexing arguments corresponds to something like the following:


x[] <- rp # for replacement
x[] <- tf(x) # for transformation

The above is true even if sign(use) < 0.

Drop

Sub-setting with the generic methods from the 'squarebrackets' R-package using dimensional arguments (s, row, col, use) always use drop = FALSE.
To drop potentially redundant (i.e. single level) dimensions, use the drop function, like so:

 ss_x(x, s, use) |> drop() # ==> x[..., drop = TRUE]

References

Plate T, Heiberger R (2016). abind: Combine Multidimensional Arrays. R package version 1.4-5, https://CRAN.R-project.org/package=abind.

Argument Pair i, use

Argument Pair s, use

Arguments set row, col, use