This project has moved. For the latest updates, please go here.

<span> tag is not recognized

Jun 6, 2016 at 1:37 AM

I have html which contains the mixed language text, english and hindi. In html, Hindi text is shown as <Span lang="hi"> some hindi text </span>

When I parse the html using converter, it doesn't output anything for the tag and I am not able to identify the hindi text after parsing.

Is there any workaround or better way to achieve this.

Jun 15, 2016 at 7:55 AM

How do you read your html file?
I would like to investigate on the encoding used for reading your html snippet and also the encoding of your file itself...