-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Space toggle switch may be useful for Gurmukhi #98
Comments
The first comment in this issue contains text that will automatically appear in the Gurmukhi gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point. |
Is the expected behavior then that line breaks still occur at word boundaries, or can they occur at any aksara boundary? |
Order of priority
Word boundary, Sandhi vichchhed, Akshara boundary.
Sandhi is not even considered anywhere in the standards for Indic
languages. Moreover, some oversimplification efforts that are limited only
to the most popular Devanagari script has also got a lot of akshara
conjuncts broken in display by representing with Halant. That practically
introduces multiple akshara boundaries in the script whereas it is still an
unbreakable unit for a word in the language. This differentiation is lost
because the exact same is possible to generate by using ZWNJ.
In short, these can never be done correctly, no matter what is defined
within the existing standards.
…On Mon, Feb 17, 2020 at 1:45 PM Norbert Lindenberg ***@***.***> wrote:
Is the expected behavior then that line breaks still occur at word
boundaries, or can they occur at any aksara boundary?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#98?email_source=notifications&email_token=ABEELW54DKBMIPZFCIVCAJTRDJBSPA5CNFSM4KQFPIWKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEL5OJOA#issuecomment-586867896>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABEELWYYDONS5GYCPGZ2FHLRDJBSPANCNFSM4KQFPIWA>
.
--
ବିବେକାନନ୍ଦ ପାଣୀ । विवेकानन्द पाणीVivekananda Pani
Research
+91-9449812397
[image: Address]
<https://www.google.com/maps/place/Reverie+Language+Technologies/@12.9213718,77.6642378,15z/data=!4m5!3m4!1s0x0:0x9ceafa1d8fa821f8!8m2!3d12.9213718!4d77.6642378>
<https://www.reverieinc.com/>
[image: facebook] <https://www.facebook.com/reverietech>
[image: twitter] <https://twitter.com/reverietech>
[image: linkedin]
<https://www.linkedin.com/company/reverie-language-technologies-pvt--ltd/>
--
The information contained in this e-mail message and/or attachments are
confidential or privileged information of Reverie Language Technologies
Pvt. Ltd. Unauthorized dissemination, use, review, distribution, printing
or copying of the information contained in this e-mail message and/or
attachments to it are strictly prohibited. If you have received this
communication in error, please notify us by reply e-mail or telephone and
immediately and permanently delete the message and any attachments.
|
@kulpreetchilana these were your words. Are you able to offer further details? |
Unclear to me what the space toggle switches from/to. ZWSP? Nothing? Something else? |
Certain older Gurmukhi texts do not contain space characters. This form is known as Larivaar and some website allow you to toggle it as a setting. It might make sense for this to be a transform property on the text, so that way we can preserve the words for line-breaking.
The text was updated successfully, but these errors were encountered: