Skip to main content
OCLC Support

Catalog using Armenian script

Discover how to catalog using Armenian script in Connexion client.

Overview

Use Armenian script data for cataloging items in languages that use the Armenian script (e.g., Eastern Armenian, Western Armenian). Use Armenian script data the same way you use other non-Latin script data in the client.

See Work with international records and Guidelines for contributing non-Latin script bibliographic records to WorldCat for details specific to non- Latin scripts. See also general procedures describing how to:

Tools for using non-Latin scripts

The client provides the following general tools to help you catalog using non-Latin scripts:

  • Link/unlink fields (Edit > Linking Fields > Link [or Unlink]) - Visually link non-Latin script data fields with equivalent romanized data fields.
  • Export options for data fields (Tools > Options > International) - Determine:
    • Whether to export both Latin-script-equivalent (romanized) data and non-Latin script data or only one or the other
    • Position of data if both
    • Sort order

     Caution: MARC-8 character verification (Edit > MARC-8 Characters > Verify) is not appropriate for verifying Armenian characters. There is no MARC-8 character set for Armenian. Using this command for Armenian results in marking all Armenian characters as invalid. The OCLC system validates Armenian characters when you validate a record.

See Use non-Latin scripts for cataloging bibliographic records for more specific procedures for working with these tools.

UTF-8 Unicode export and import required for Armenian records

Because Armenian script is not included in MARC-8 character sets, you must export and import records in Unicode format (settings are in Tools > Options > Export and in File > Import Records/Options button).

About Unicode

Unicode is the universal character encoding scheme for written characters and text. It defines a consistent way of encoding multi-script text that enables the exchange of text data internationally.

Unicode provides for three encoding forms: a 32-bit form (UTF-32), a 16-bit form (UTF-16), and an 8-bit form (UTF-8, designed for use with ASCII-based systems).

Connexion client began supporting Armenian script with Unicode version 6.2.0.

Armenian script entry and character sets

Script entry method

If your system default language is not Armenian, you can install Armenian, and Windows provides an input keyboard for entering Armenian script. See Get started and Input methods for languages that use non-Latin scripts.

Character set supported

Armenian characters are defined in Unicode 6.2 (coded in the range U+0530 to U+058F).

Script identifier in records

The client adds the following data to field 066 ‡c in Armenian records to indicate the presence of Armenian characters:

  • Armn

Romanized data

See the ALA-LC Romanization Table for Armenian on the Library of Congress website for more information.

Indexing for Armenian script searches

Notes on searching

  • Use word or phrase search indexes and browse indexes. 
  • Word searches find the data string you enter anywhere in the indexed field. Phrase searches find the data string starting with the first character in a field or subfield and including each character in exact order. Browsing scans an index for the closest match to the character string followed by any other data. 
  • If you use qualifiers to limit a search, type them in Latin script. 
  • Do not use derived searching. 
  • You can truncate searches (asterisk (*) at the end of a search term) or use browsing for automatic truncation (enter only as many characters as needed for a match, without using an asterisk). 
  • If you want to retrieve all Armenian script records or see sample records, use the Character Sets Present WorldCat search index (label vp:) with the assigned code arm.
    • To find all Armenian script records, enter vp:arm in the command line of the Search WorldCat window (Cataloging > Search > WorldCat).

       Note: If a search for all Armenian script records alone retrieves too many WorldCat records (limit 1,500 records), you must limit the search and try again (e.g., vp:arm/1991-vp:arm and mt:bks; etc.).

See general procedures and search techniques for searching WorldCat.

Armenian character indexing specifics

All capital or uppercase/capital forms of Armenian letters are indexed the same as the corresponding small or lowercase/small forms of Armenian letters.

For example, Armenian capital letter da (Armenian capital letter da) is indexed the same as Armenian small letter da (Armenian small letter da).

Enter either a capital or small version of a Cyrillic letter in a search and retrieve results for both.

Armenian modifier letters (Left half ring, Apostrophe, Emphasis mark, Exclamation mark, Comma and Question mark) are ignored in indexing.

 

  • Was this article helpful?