您可以使用REPLACE http://pig.apache.org/docs/r0.10.0/func.html#replace函数(不过它不会一次性完成):
file1 = load 'your.csv' using PigStorage(',');
data = foreach file1 generate $0 as (f1:chararray), $1 as (f2:chararray), REPLACE($2, '\\"', '') as (f3:int), REPLACE($3, '\\"', '') as (f4:int);
您还可以使用正则表达式REGEX_EXTRACT http://pig.apache.org/docs/r0.10.0/func.html#regex-extract :
file1 = load 'your.csv' using PigStorage(',');
data = foreach file1 generate $0, $1, REGEX_EXTRACT($2, '([0-9]+)', 1), REGEX_EXTRACT($3, '([0-9]+)', 1);
当然,你可以删除"
对于 f1 和 f2 以同样的方式。