[jira] [Commented] (TEXT-89) Add UTF-32 surrogate pairs support for WordUtils.initials()

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[jira] [Commented] (TEXT-89) Add UTF-32 surrogate pairs support for WordUtils.initials()

JIRA jira@apache.org

    [ https://issues.apache.org/jira/browse/TEXT-89?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16083731#comment-16083731 ]

ASF GitHub Bot commented on TEXT-89:
------------------------------------

Github user ecki commented on the issue:

    https://github.com/apache/commons-text/pull/49
 
    Not sure I understand the question, surrogate Pairs only exist in UTF-16. UTF-8 uses a multi byte encoding for code points outside the BMP and UTF-32 uses 4 bytes (and skips the high/low surrogate regions)


> Add UTF-32 surrogate pairs support for WordUtils.initials()
> -----------------------------------------------------------
>
>                 Key: TEXT-89
>                 URL: https://issues.apache.org/jira/browse/TEXT-89
>             Project: Commons Text
>          Issue Type: Improvement
>            Reporter: Arun Vinud
>            Priority: Minor
>
> The current implementation of WordUtils.initials() doesn't support UTF-32 . Refactor the code to provide support using UTF-16 surrogate pairs  . The proposed improvement should provide support to characters outside BMP.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
Loading...