hadoop - Revolution R: filter an xdf using another xdf -
i'm trying filter large xdf file (sitting on cloudera) using in
compare xdf file (sitting on cloudera).
if try:
rxdatastep(indata = largexdf, outfile = largexdf, overwrite = true, transformobjects = list(param1 = parameter), rowselection = (columnnamefromlargexdf %in% param1$1), numrows = -1))
i error regarding param$1
saying objects of class s4 cannot subsetted. in xdf data objects cannot subsetted. there way around this? rather not convert xdf local csv file because large.
xdfs
can subsetted, can't overwrite xdf
on hdfs
. need write different file. also, if saying parameter
xdf
way using in rowselection
not correct.
Comments
Post a Comment