hadoop - Error while using Revolution R to convert a csv file to an xdf using rxImport function -
i trying take csv file cloudera , convert xdf
i have tried following:
inputfile <- file.path("/user/...") #create column classes data set columnclasses <- c("character", "character", "character", "character", "character", "character", "character", "character", "character", "character", "character", "numeric", "character" ) names(columnclasses) <- paste("v", seq(1:13), sep = "") ##convert input csv file rxtextdata object textdata <- rxtextdata(file = inputfile, filesystem = hdfsfs, colclasses = columnclasses ) ##set chunk size chunk.size <- 250000 ##create output file location newxdf <- rxxdfdata("/user/...", filesystem = hdfsfs) rximport(indata = inputfile, outdata = newxdf, rowsperread = chunk.size, overwrite = true, numrows = -1)
when run following error:
error in rxuhandleclusterjobtryfailure(retobject, hpcserverjob, autocleanup, : error completing job on cluster: error in rxexecindatahadoop(callinfo, matchcall) : data source not have hdfs file system type.
note: have looked @ textdata
using functions rxgetinfo
, looks fine.
any insights why i'm getting error?
update: typo in rximport function.
actually ran:
rximport(indata = textdata, outdata = newxdf, rowsperread = chunk.size, overwrite = true, numrows = -1)
and got following error:
error in rxuhandleclusterjobtryfailure(retobject, hpcserverjob, autocleanup, : error completing job on cluster: error in rxcall("rx_importdatasource", params) :
Comments
Post a Comment