[Rcpp-devel] Rcpp and C++ mangling - Rcpp-devel

Fri, Dec 21, 2012 3:12 AM #

hi,
a quick question regarding a Rcpp function I wrote using an external
library (gretl) : 
with the help of a makefile (adapted from the convolution Rcpp
example), the code compiles just fine and turns into a shared object
in linux (.so file)
however, when I then try to load the newly created shared library into
R using dyn.load, I get the following error message:
	dyn.load("/home/jean/Documents/code experiments/gretl/test2.so")
Error in dyn.load("/home/jean/Documents/code
experiments/gretl/test2.so") :    unable to load shared object
'/home/jean/Documents/code experiments/gretl/test2.so':  
/home/jean/Documents/code experiments/gretl/test2.so: undefined
symbol: _Z13kalman_smoothP7kalman_PP13gretl_matrix_S3_Pi
The undefined symbol is in fact (after c++ unmangling)
kalman_smooth(kalman_*, gretl_matrix_**, gretl_matrix_**, int*), a
function from the gretl library
Do you think the problem comes from the c++ mangling of gretl's  C
library or is it rather linked to internals of gretl ?
many thanks for your help!
jean
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20121221/d05d9ca3/attachment.html>

Romain Francois

Fri, Dec 21, 2012 4:04 AM #

Are you linking against the gretl library ?


Le 21 d?c. 2012 ? 12:12, jean.p at hushmail.com a ?crit :

_______________________________________________
Rcpp-devel mailing list
Rcpp-devel at lists.r-forge.r-project.org
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20121221/d26f109b/attachment.html>

Dirk Eddelbuettel

Fri, Dec 21, 2012 4:15 AM #

On 21 December 2012 at 12:12, jean.p at hushmail.com wrote:

| hi,
| 
| a quick question regarding a Rcpp function I wrote using an external library
| (gretl) : 
| 
| with the help of a makefile (adapted from the convolution Rcpp example), the
| code compiles just fine and turns into a shared object in linux (.so file)
| 
| however, when I then try to load the newly created shared library into R using
| dyn.load, I get the following error message:
| 
| 
| dyn.load("/home/jean/Documents/code experiments/gretl/test2.so")
| Error in dyn.load("/home/jean/Documents/code experiments/gretl/test2.so") :
|   unable to load shared object '/home/jean/Documents/code experiments/gretl/test2.so':
|   /home/jean/Documents/code experiments/gretl/test2.so: undefined symbol: _Z13kalman_smoothP7kalman_PP13gretl_matrix_S3_Pi
| 
| 
| The undefined symbol is in fact (after c++ unmangling) kalman_smooth(kalman_*,
| gretl_matrix_**, gretl_matrix_**, int*), a function from the gretl library
| 
| Do you think the problem comes from the c++ mangling of gretl's  C library or
| is it rather linked to internals of gretl ?

I suspect you are doing something wrong, that is a standard linker error. It
could be as easy as forgetting the extern "C" or something.

The name wrangling happens because it is after all C++ and not C..  Look at
the Rcpp + GSL examples for inspiration, or at other CRAN packages working
with third-party C libraries.  This works well.

| many thanks for your help!

We cannot help much more as you example is not reproducible.

Dirk

Dirk Eddelbuettel | edd at debian.org | http://dirk.eddelbuettel.com

jean.p at hushmail.com

Fri, Dec 21, 2012 4:26 AM #

yes I am linking to these gretl libraries but for some reason removed
the extern "C", which I shall now bring back!
I suspected it was something like that but just wanted to be sure
as always, thanks a lot!

On Friday, December 21, 2012 at 1:15 PM, "Dirk Eddelbuettel"  wrote:On

21 December 2012 at 12:12, jean.p at hushmail.com wrote:

| hi,
| 
| a quick question regarding a Rcpp function I wrote using an external
library
| (gretl) : 
| 
| with the help of a makefile (adapted from the convolution Rcpp
example), the
| code compiles just fine and turns into a shared object in linux (.so
file)
| 
| however, when I then try to load the newly created shared library
into R using
| dyn.load, I get the following error message:
| 
| 
| dyn.load("/home/jean/Documents/code experiments/gretl/test2.so")
| Error in dyn.load("/home/jean/Documents/code
experiments/gretl/test2.so") :
|   unable to load shared object '/home/jean/Documents/code
experiments/gretl/test2.so':
|   /home/jean/Documents/code experiments/gretl/test2.so: undefined
symbol: _Z13kalman_smoothP7kalman_PP13gretl_matrix_S3_Pi
| 
| 
| The undefined symbol is in fact (after c++ unmangling)
kalman_smooth(kalman_*,
| gretl_matrix_**, gretl_matrix_**, int*), a function from the gretl
library
| 
| Do you think the problem comes from the c++ mangling of gretl's  C
library or
| is it rather linked to internals of gretl ?

I suspect you are doing something wrong, that is a standard linker
error. It
could be as easy as forgetting the extern "C" or something.

The name wrangling happens because it is after all C++ and not C.. 
Look at
the Rcpp + GSL examples for inspiration, or at other CRAN packages
working
with third-party C libraries.  This works well.

| many thanks for your help!

We cannot help much more as you example is not reproducible.

Dirk

Dirk Eddelbuettel | edd at debian.org | http://dirk.eddelbuettel.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.r-forge.r-project.org/pipermail/rcpp-devel/attachments/20121221/ed092874/attachment.html>

Nick Matzke

Fri, Dec 21, 2012 4:16 PM #

Hi,

I'm working on a problem that involves calculating a very 
large, sparse transition probability matrix, P.  It takes 
significant time to run through the whole matrix, check all 
the transitions, and put in the appropriate probabilities, 
based on multiplying parameter values j, d, e, etc.

Each time I update j, or d, or e, I would rather not go 
re-process which cells of P get j, j*d, d*e, etc.  It would 
be better to just do this once, with j, d, and e each just 
referring to a memory address.  I could then update the 
values at that memory address and re-use the matrix P 
without re-building it again and again.

Currently, my script builds P via an Rcpp function, then 
passes it to R, where it gets used (with the values 
calculated) a few hundred times (via another Rcpp function); 
then j, d, and e are updated and I have to re-calculate P.

So, my question is, is there a way to make P, currently a 
vector of float vectors, into a vector of float pointers, 
which can be passed to R and then back to Rcpp?  Or is this 
just ridiculous?

Cheers,
Nick

====================================================
Nicholas J. Matzke
Ph.D. Candidate, Graduate Student Researcher

Huelsenbeck Lab
Center for Theoretical Evolutionary Genomics
4151 VLSB (Valley Life Sciences Building)
Department of Integrative Biology
University of California, Berkeley

Graduate Student Instructor, IB200B
Principles of Phylogenetics: Ecology and Evolution
http://ib.berkeley.edu/courses/ib200b/
http://phylo.wikidot.com/


Lab websites:
http://ib.berkeley.edu/people/lab_detail.php?lab=54
http://fisher.berkeley.edu/cteg/hlab.html
Dept. personal page: 
http://ib.berkeley.edu/people/students/person_detail.php?person=370
Lab personal page: 
http://fisher.berkeley.edu/cteg/members/matzke.html
Lab phone: 510-643-6299
Dept. fax: 510-643-6264

Cell phone: 510-301-0179
Email: matzke at berkeley.edu

Mailing address:
Department of Integrative Biology
1005 Valley Life Sciences Building #3140
Berkeley, CA 94720-3140

-----------------------------------------------------
"[W]hen people thought the earth was flat, they were wrong. 
When people thought the earth was spherical, they were 
wrong. But if you think that thinking the earth is spherical 
is just as wrong as thinking the earth is flat, then your 
view is wronger than both of them put together."

Isaac Asimov (1989). "The Relativity of Wrong." The 
Skeptical Inquirer, 14(1), 35-44. Fall 1989.
http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
====================================================

Dirk Eddelbuettel

Fri, Dec 21, 2012 4:40 PM #

Nick,

On 21 December 2012 at 16:16, Nick Matzke wrote:

| I'm working on a problem that involves calculating a very 
| large, sparse transition probability matrix, P.  It takes 
| significant time to run through the whole matrix, check all 
| the transitions, and put in the appropriate probabilities, 
| based on multiplying parameter values j, d, e, etc.
| 
| Each time I update j, or d, or e, I would rather not go 
| re-process which cells of P get j, j*d, d*e, etc.  It would 
| be better to just do this once, with j, d, and e each just 
| referring to a memory address.  I could then update the 
| values at that memory address and re-use the matrix P 
| without re-building it again and again.
| 
| Currently, my script builds P via an Rcpp function, then 
| passes it to R, where it gets used (with the values 
| calculated) a few hundred times (via another Rcpp function); 
| then j, d, and e are updated and I have to re-calculate P.
| 
| So, my question is, is there a way to make P, currently a 
| vector of float vectors, into a vector of float pointers, 
| which can be passed to R and then back to Rcpp?  Or is this 
| just ridiculous?

There will be lots of different ways to skin this cat. Sparse matrices may be
one, I have no experience there. They now exist in Armadillo, though without
[at current] glue code for as<> and wrap in RcppArmadillo; this does exist in
Eigen and RcppEigen. Others may help you on sparse stuff.

You can also define you own structure to hold those vectors, and then simply
pass one single pointer around using the Rcpp::XPtr wrapper class for R's
external pointer. If data volume is an issue, external pointers are your
friends. (Some of) the database class package uses this, and as does the
bigmemory familiy of packages.  

But you should do some measuring and profiling before you go off and
rearchitect your application based on a hunch.  Great slide deck seen today
and retweeted:  

  The only good intuition: 'I should time time this' 
  -- Andrei Alexandrescu

  Via http://isocpp.org/blog/2012/12/three-optimization-tips-alexandrescu

and I think he has that exactly right.  And he has about a gazillion times as
much street cred on C++ as I do ....

Dirk

PS You need to work on your signature. Not sure it's long and detailed enough.

Dirk Eddelbuettel | edd at debian.org | http://dirk.eddelbuettel.com

Søren Højsgaard

Sat, Dec 22, 2012 9:47 AM #

Dear all,

To supplement Dirk's remarks about using RcppEigen in connection with sparse matrices: Here is an example where I use RcppEigen to moralized a directed acyclic graph represented by an adjacency matrix which is sparse (a dgCMatrix from R's Matrix package). The code can actually be made more efficient by replacing the for-loops by a different type of iterator which ensures that only non-zero entries are visited. The "triplet-trick" is the "recommended" way of building up a sparse matrix - as far as I can tell. There may be many other possible improvents which I am unaware of (as I don't know c++).

Best regards
S?ren


RcppExport SEXP C_moralizeM ( SEXP XX_){
  using Eigen::Map;
  using namespace Rcpp;
  typedef Eigen::MappedSparseMatrix<double> MSpMat;
  typedef Eigen::SparseMatrix<double> SpMat;
  SpMat   X(as<MSpMat>(XX_));
  
  typedef Eigen::Triplet<double> T;
  std::vector<T> triplets;
  triplets.reserve(X.nonZeros() * 2);
  
  int nrX(X.rows());
  int kk, ll, vv;
  for (vv=0; vv<nrX; vv++){ /* consider vertex vv */
    for (kk=0; kk<nrX; kk++){
      if (X.coeff(kk, vv) != 0){     /* yes, kk->vv */
	for (ll=kk+1; ll<nrX; ll++){
	  if (X.coeff(ll, vv) != 0){ /* yes, ll->vv */
	    if ((X.coeff(kk, ll)==0) && (X.coeff(ll, kk)==0)){ /* kk not~ ll */
	      triplets.push_back(T(kk, ll, 1));
	      triplets.push_back(T(ll, kk, 1));
	    }
	  }
	}
      }
    }
  }
  
  SpMat ans(X.rows(), X.cols());
  ans.setFromTriplets(triplets.begin(), triplets.end());
  SpMat Xt(X.transpose());
  ans = ans + Xt + X;
  
  for (kk=0; kk<nrX; kk++){
    for (ll=kk+1; ll<nrX; ll++){
      if (ans.coeff(kk,ll)!=0){
	ans.coeffRef(kk,ll)=1;
	ans.coeffRef(ll,kk)=1;
      }
    }
  }
  ans.makeCompressed();
  return(wrap(ans));
}

-----Original Message-----
From: rcpp-devel-bounces at lists.r-forge.r-project.org [mailto:rcpp-devel-bounces at lists.r-forge.r-project.org] On Behalf Of Dirk Eddelbuettel
Sent: 22. december 2012 01:41
To: matzke at berkeley.edu
Cc: rcpp-devel at lists.r-forge.r-project.org
Subject: Re: [Rcpp-devel] Is there a way to pass references to R


Nick,

On 21 December 2012 at 16:16, Nick Matzke wrote:

| I'm working on a problem that involves calculating a very large, 
| sparse transition probability matrix, P.  It takes significant time to 
| run through the whole matrix, check all the transitions, and put in 
| the appropriate probabilities, based on multiplying parameter values 
| j, d, e, etc.
| 
| Each time I update j, or d, or e, I would rather not go re-process 
| which cells of P get j, j*d, d*e, etc.  It would be better to just do 
| this once, with j, d, and e each just referring to a memory address.  
| I could then update the values at that memory address and re-use the 
| matrix P without re-building it again and again.
| 
| Currently, my script builds P via an Rcpp function, then passes it to 
| R, where it gets used (with the values
| calculated) a few hundred times (via another Rcpp function); then j, 
| d, and e are updated and I have to re-calculate P.
| 
| So, my question is, is there a way to make P, currently a vector of 
| float vectors, into a vector of float pointers, which can be passed to 
| R and then back to Rcpp?  Or is this just ridiculous?

There will be lots of different ways to skin this cat. Sparse matrices may be one, I have no experience there. They now exist in Armadillo, though without [at current] glue code for as<> and wrap in RcppArmadillo; this does exist in Eigen and RcppEigen. Others may help you on sparse stuff.

You can also define you own structure to hold those vectors, and then simply pass one single pointer around using the Rcpp::XPtr wrapper class for R's external pointer. If data volume is an issue, external pointers are your friends. (Some of) the database class package uses this, and as does the bigmemory familiy of packages.  

But you should do some measuring and profiling before you go off and rearchitect your application based on a hunch.  Great slide deck seen today and retweeted:  

  The only good intuition: 'I should time time this' 
  -- Andrei Alexandrescu

  Via http://isocpp.org/blog/2012/12/three-optimization-tips-alexandrescu

and I think he has that exactly right.  And he has about a gazillion times as much street cred on C++ as I do ....

Dirk

PS You need to work on your signature. Not sure it's long and detailed enough.

--
Dirk Eddelbuettel | edd at debian.org | http://dirk.eddelbuettel.com _______________________________________________
Rcpp-devel mailing list
Rcpp-devel at lists.r-forge.r-project.org
https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/rcpp-devel