i18N Inc. logo and banner i18N Inc. banner
|   Home    |   About Us    |   Contact Us    |   Français   |   
Services
Workshops
Overview
Agenda
Program
Events
Publications
Resources
Customers
Partners
Program   
 

Unicode, Multilingual Databases and Asian Character Sets
Establishing The Proper Foundation For Your Global Product
 

Welcome and Introduction

  • Who's who?
  • What do we want from this workshop?

Coded character sets: Past, Present and Future

  • Coded character sets
    • Controls vs. Graphics
    • Glyphs
  • A short history of character sets
    • Morse code
    • Baudot code
    • ISO 646, ISO 2022 and ISO 8859
    • Windows code pages
    • Shift-JIS
    • GBK
  • Unicode
    • Unicode 1.0
    • High-Level structure
    • Unicode 4.0

Unicode character set and standards

  • Overview
  • The Unicode character set
    • Notation
    • The 10 Unicode design principles
    • Special characters
    • Special non-characters
  • The Unicode standard
    • Elements of the standard
  • Unicode vs. ISO 10646
    • Main differences
    • Unicode conformance

Representing Unicode: choosing the proper form

  • Unicode encodings
    • UTF-32
    • UTF-16
    • UTF-8
    • CESU-8
  • Compression schemes
    • SCSU
    • BOCU-1
  • Normalization Forms

Unicode implementation

  • Reference i18n model
  • Transcoding
  • Text processing
    • Case handling
    • Case mapping, folding
  • Text boundaries
    • Grapheme clusters
    • Words
    • Lines
    • Sentences
  • Collation
  • Sorting and searching

Database issues

  • Overview
  • Unicode support
    • UTF-8
    • UTF-16
  • Multilingual schema design
    • Stringtables per column
    • Stringtables per table
    • Database global stringtable
  • Database migration to Unicode
    • Migration concerns
    • How to migrate

Asian character sets

  • GB 18030-2000
    • Background
    • Properties
    • Encoding
    • Conformance
  • HKSCS-2001
    • Background
    • Encoding
    • HKSCS & Unicode
    • HKSCS & Big-5
  • JIS 0213:2000
    • Background
    • Properties
    • Encoding
    • Conformance
  • Korean character sets
    • The Korean writing system
    • Jamo
    • Hangul syllables
    • Hangul - Implementation
    • KS X 1001 character set
    • KS X 1001 encoding
    • Microsoft code page 949
    • Unicode

  top of page
i18N Inc. © 2001-2010  |   Email: info@i18n.ca