Dear Learner,
We hope you are doing good.
I would request you to use the XMLloader.jar to read the XML file. Please download the same from the below link:
Suppose we have input data as:
<Document>
<name>Tejas</name>
</Document>
<name>Tejas</name>
</Document>
Now we can read the XML data using the below commands after going in the grunt shell:
register /home/edureka/Desktop/XMLloader.jar;
pigdata = load '/xml1.xml' USING XMLLoader('name') as (doc:chararray);
values = foreach pigdata GENERATE FLATTEN(REGEX_EXTRACT_ALL(doc,'<name>(.*)</name>')) AS (name:chararray);
dump values;
Please refer to the below screen shot of the output which we are getting at our end:
Please try this and let us know if you face any issue.
I will wait for your response.