Tom Lane wrote:
> "Pavel Stehule" <pavel.stehule@gmail.com> writes:
>
>> I am writing tsearch2 wrapper and I testing functionality. I found
>> some little bit strange on default parser. It can't parse tags with
>> numbers:
>>
>
> Well, the state machine definitely thinks that tag names should contain
> only ASCII letters (with possibly a leading or trailing '/'). Given the
> HTML examples I suppose we should allow non-first digits too. Is there
> anything else that should be considered a tag? What about dash and
> underscore for instance?
>
>
>
The docs say we specifically accept HTML tags. Are we really just
accepting anything that is a string of ASCII letters as the tag name?
Then we should adjust the docs. <foo> and <foo1234> are not HTML tags.
cheers
andrew