Skip to main content
OCLC Support

Details from OCLC: Library of Congress scripts

Discover details and further information about issues displaying and validating certain characters in Library of Congress records.

The Library of Congress (LC) recently announced the expansion of non-Latin script input in bibliographic and authority records. OCLC is working closely with LC to ensure that records distributed to OCLC are correctly rendered in their original scripts. OCLC will roll out support for scripts as they are validated for use and as languages continue to be implemented through Spring 2026. 

Most scripts validate and export successfully. You may experience some minor character display irregularities (e.g., dotted circles, white rectangles, spacing issues) in certain interfaces. Five characters across two scripts do not validate successfully. Please review the Known Issues section below for additional details and recommended actions.

You should export records containing these scripts in UTF-8 format in Connexion 2.63, Connexion 3.1, WorldShare Record Manager, or WorldShare Collection Manager. MARC-8 export does not support these scripts.  

We are committed to full support of LC’s script expansion and will continue to work closely with LC. Our goal is to ensure that as libraries contribute records with scripts, they integrate seamlessly across OCLC’s systems.  

Language Support and Display Optimization

Record Manager and Connexion support numerous languages for cataloging. 

For optimal display in Connexion, we recommend using either Noto Sans or Nirmala UI font. Please review the recommendations below based on your Connexion client version and working language/script. 

Other fonts may provide a similar experience in Connexion, and OCLC's font recommendation does not suggest that other fonts would not work with a given language/script. 

Recommended font for Connexion 3.1 users for all languages/scripts: Noto Sans

Language Script Recommended font - Connexion 2.63
Hindi Devanagari Noto Sans
Marathi Devanagari Noto Sans
Sanskrit Devanagari Noto Sans
Nepali Devanagari Noto Sans
Prakrit Devanagari Noto Sans
Tamil Tamil Nirmala UI
Bengali Bengali Nirmala UI
Assamese Bengali Nirmala UI
Malayalam Malayalam Nirmala UI
Panjabi Gurmukhi Nirmala UI
Gujarati Guarti Nirmala UI
Sinhalese Sinhala Nirmala UI
Telugu Telugu Nirmala UI
Kannada Kannada Nirmala UI
Odia/Oriya Odia/Oriya Nirmala UI
Burmese Burmese Nirmala UI
Georgian Georgian Nirmala UI
Kurdish Arabic Nirmala UI

If you do not have Google Noto Sans, it is free to download. For additional information about installing Google Noto Sans please see Google Noto fonts for Windows.

Known Issues

OCLC has identified five characters that do not pass validation within Telugu and Sinhala. If you need to enter these characters into a bibliographic or authority record, please use the hex values provided.

Telugu - Visarga

Name: TELUGU SIGN VISARGA
Unicode: U+0C04
Hex value: ఼
Character: ఼
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)

Telugu - Nukta

Name: TELEUGU SIGN NUKTA
Unicode: U+0C3C
Hex value: ఼
Character:
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)

Telugu - Nakaara Pollu

Name: TELUGU LETTER NAKAARA POLLU
Unicode:  U+0C5D
Hex value: ౝ
Character:
Title: శ్రీ కృష్దేవమహారాయల ప్రభుత్వము
Where it appears: In కృష్దేవ (archaic spelling of Kṛṣṇadēva)

Telugu - Siddham

Name: TELEUGU SIGN SIDDHAM
Unicode: U+0C77
Hex value: ౷
Character:
Title: శ్రీ లలితా సహస్రనామ స్తోత్రం
Where it appears: At the end of the title as a sacred marker

Sinhala - Candrabindu

Name: SINHALA SIGN CANDRABINDU
Unicode: U+0D81
Hex value: ඁ
Character: ඁ
Title: සංස්කෘත-සිංහල ශබ්දකෝෂය
Where it appears: At the end of කෝෂය (kōṣayaṁ)