-
Notifications
You must be signed in to change notification settings - Fork 0
How To
RProvider discovers the packages installed in your R installation and makes them available as packages under the RProvider root namespace. The actual package is lazily loaded the first time you access it.
Currently you need to load up a real R session, then install the package via install.packages, or the Packages/Install Packages... menu. You will then need to restart Visual Studio because the set of installed packages is cached inside the RProvider.
The most likely cause is that RProvider is using a different R installation from the one you updated. When you install R, you get the option to update the registry key HKEY_LOCAL_MACHINE\SOFTWARE\R-core to point to the version you are installing. This is what RProvider uses. If you are running in a 32-bit process, RProvider uses HKEY_LOCAL_MACHINE\SOFTWARE\R-core\R\InstallPath to determine the path. For 64-bit, it reads HKEY_LOCAL_MACHINE\SOFTWARE\R-core\R64\InstallPath. When you install a package in a given version of R, it should be available in both the 32-bit and 64-bit versions.
There are a couple of mismatches between allowed identifiers between R and F#:
It is pretty common in R to use a dot character in a name, because the character has no special meaning. We remap dots to underscore, and underscore to a double-underscore. So for example, data.frame() becomes R.data_frame().
Some package and function names are reserved words in F#. For these, you will need to quote them using double-backquotes. Typically, the IDE will do this for you. A good example is the base package, which will require an open statement where "base" is double-back-quoted.
R supports various kinds of parameters, which we try to map onto equivalent F# parameter types:
- All R formal parameters have names, and you can always pass their values either by name or positionally. If you pass by name, you can skip arguments in your actual argument list. We simply map these onto F# arguments, which you can also pass by name or positionally.
- In R, essentially all arguments are optional (even if no default value is specified in the function argument list). It's up to the receiving function to determine whether to error if the value is missing. So we make all arguments optional.
- R functions support ... (varargs/paramarray). We map this onto a .NET ParamArray, which allows an arbitrary number of arguments to be passed. However, there are a couple of kinks with this:
- R allows named arguments to appear after the ... argument, whereas .NET requires the ParamArray argument to be at the end. Some R functions use this convention because their primary arguments are passed in the ... argument and the named arguments will sometimes be used to modify the behavior of the function. From the RProvider you will to supply values for the positional arguments before you can pass to the ... argument. If you don't want to supply a value to one of these arguments, you can explicitly pass System.Reflection.Missing.
- Parameters passed to the R ... argument can also be passed using a name. Those names are accessible to the calling function. Example are list and dataframe construction (R.list, and R.data_frame). To pass arguments this way, you can use the overload of each function that takes an IDictionary<string, obj>, either directly, or using the namedParams function. For example:
R.data_frame(namedParams [ "A", [|1;2;3|]; "B", [|4;5;6|] ])
Since all arguments to functions are of type obj, it is not necessarily obvious what you can pass. Ultimately, you will need to know what the underlying function is expecting, but here is a table to help you. When reading this, remember that for most types, R supports only vector types. There are no scalar string, int, bool etc. types.
| R Type | F#/.NET Type |
|---|---|
| character | string or string[] |
| complex | System.Numerics.Complex or Complex[] |
| integer | int or int[] |
| logical | bool or bool[] |
| numeric | double or double[] |
| list | Call R.list, passing the values as separate arguments |
| dataframe | Call R.data_frame, passing column vectors in a dictionary |
R has some high-level functions (e.g. sapply) that require a function parameter. Although F# has first-class support of functional programming and provides better functionality and syntax for apply-like operations, which often makes it sub-optimal to call apply-like high-level functions in R, the need for parallel computing in R, which is not yet directly supported by F# parallelism to R functions, requires users to pass a function as parameter. Here is an example way to create and pass an R function:
let fun1 = R.eval(R.parse(text="function(i) {mean(rnorm(i))}"))
let nums = R.sapply(R.c(1,2,3),fun1)
The same usage also applies to parallel apply functions in parallel package.
Functions exposed by the RProvider return an instance of RDotNet.SymbolicExpression. This keeps all return data inside R data structures, so does not impose any data marshalling overhead. If you want to pass the value in as an argument to another R function, you can simply do so.
In order to access the result in .NET code, you have three routes:
RProvider adds a generic GetValue<> extension method to SymbolExpression. This supports conversions from certain R values to specific .NET types. Here are the currently supported conversions:
| R Type | Requested F#/.NET Type |
|---|---|
| character (when vector is length 1) | string |
| character | string[] |
| complex (when vector is length 1) | Complex |
| complex | Complex[] |
| integer (when vector is length 1) | int |
| integer | int[] |
| logical (when vector is length 1) | bool |
| logical | bool[] |
| numeric (when vector is length 1) | double |
| numeric | double[] |
Custom conversions can be supported through plugins.
We also expose an extension property called Value that performs a default conversion of a SymbolicExpresion to a .NET type. These are the current conversions:
| R Type | F#/.NET Type |
|---|---|
| character | string[] |
| complex | Complex[] |
| integer | int[] |
| logical | bool[] |
| numeric | double[] |
Again, custom conversions can be supported through plugins.
If there are no supported conversions, you can access the data through the RDotNet object model. RDotNet exposes properties, members and extension members (available only if you open the RDotNet namespace) that allow you to access the underlying data directly. So, for example:
let res = R.sum([|1;2;3;4|])
if res.Type = RDotNet.Internals.SymbolicExpressionType.IntegerVector then res.AsInteger().[0]
else failwithf "Expecting a Numeric but got a %A" res.Type
To make this easier, we have defined some active patterns, under the RProvider.Helpers namespace, which is auto-opened when you open the RProvider namespace. These combine the type tests and conversion. An equivalent example:
match R.sum([|1;2;3;4|]) with
| IntegerVector(iv) -> iv.[0]
| _ -> failwithf "Expecting a Numeric but got a %A" res.Type
If you believe the argument conversion is universally appropriate and should be available to everybody, please fork the repo and submit a pull request.
RProvider also supports custom conversions to/from your own data types using plugins.