[jira] [Created] (CSV-51) Support unicode line separators

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CSV-51) Support unicode line separators

Gary D. Gregory (Jira)
Support unicode line separators
-------------------------------

                 Key: CSV-51
                 URL: https://issues.apache.org/jira/browse/CSV-51
             Project: Commons CSV
          Issue Type: Improvement
          Components: Parser
            Reporter: Emmanuel Bourg
            Assignee: Emmanuel Bourg
            Priority: Minor
             Fix For: 1.x


CSVParser doesn't recognizes the Unicode characters U+2028 (LINE SEPARATOR), U+2029 (PARAGRAPH SEPARATOR) and U+0085 (NEXT LINE) as line separators.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (CSV-51) Support unicode line separators

Gary D. Gregory (Jira)

    [ https://issues.apache.org/jira/browse/CSV-51?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228058#comment-13228058 ]

Sebb commented on CSV-51:
-------------------------

Would these be used as an alternative to CR/LF?
Or would both be operative at the same time?

Are there any example files that use such separators?
               

> Support unicode line separators
> -------------------------------
>
>                 Key: CSV-51
>                 URL: https://issues.apache.org/jira/browse/CSV-51
>             Project: Commons CSV
>          Issue Type: Improvement
>          Components: Parser
>            Reporter: Emmanuel Bourg
>            Priority: Minor
>             Fix For: 1.x
>
>
> CSVParser doesn't recognizes the Unicode characters U+2028 (LINE SEPARATOR), U+2029 (PARAGRAPH SEPARATOR) and U+0085 (NEXT LINE) as line separators.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       
Reply | Threaded
Open this post in threaded view
|

[jira] [Commented] (CSV-51) Support unicode line separators

Gary D. Gregory (Jira)
In reply to this post by Gary D. Gregory (Jira)

    [ https://issues.apache.org/jira/browse/CSV-51?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13228079#comment-13228079 ]

Emmanuel Bourg commented on CSV-51:
-----------------------------------

They would be used in addition to the existing line separators. The Scanner and Pattern classes in the JDK do recognize these separators for example, but not BufferedReader.
               

> Support unicode line separators
> -------------------------------
>
>                 Key: CSV-51
>                 URL: https://issues.apache.org/jira/browse/CSV-51
>             Project: Commons CSV
>          Issue Type: Improvement
>          Components: Parser
>            Reporter: Emmanuel Bourg
>            Priority: Minor
>             Fix For: 1.x
>
>
> CSVParser doesn't recognizes the Unicode characters U+2028 (LINE SEPARATOR), U+2029 (PARAGRAPH SEPARATOR) and U+0085 (NEXT LINE) as line separators.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira