a746898dff
libstdc++-v3: 2020-12-25 Gerald Pfeifer <gerald@pfeifer.com> * doc/xml/manual/messages.xml: Update link to Java docs. * doc/html/manual/facets.html: Ditto.
569 lines
16 KiB
XML
569 lines
16 KiB
XML
<section xmlns="http://docbook.org/ns/docbook" version="5.0"
|
|
xml:id="std.localization.facet.messages" xreflabel="Messages">
|
|
<?dbhtml filename="messages.html"?>
|
|
|
|
<info><title>messages</title>
|
|
<keywordset>
|
|
<keyword>ISO C++</keyword>
|
|
<keyword>messages</keyword>
|
|
</keywordset>
|
|
</info>
|
|
|
|
|
|
|
|
<para>
|
|
The <classname>std::messages</classname> facet implements message retrieval functionality
|
|
equivalent to Java's <classname>java.text.MessageFormat</classname> using either GNU <function>gettext</function>
|
|
or IEEE 1003.1-200 functions.
|
|
</para>
|
|
|
|
<section xml:id="facet.messages.req"><info><title>Requirements</title></info>
|
|
|
|
|
|
<para>
|
|
The <classname>std::messages</classname> facet is probably the most vaguely defined facet in
|
|
the standard library. It's assumed that this facility was built into
|
|
the standard library in order to convert string literals from one
|
|
locale to the other. For instance, converting the "C" locale's
|
|
<code>const char* c = "please"</code> to a German-localized <code>"bitte"</code>
|
|
during program execution.
|
|
</para>
|
|
|
|
<blockquote>
|
|
<para>
|
|
22.2.7.1 - Template class messages [lib.locale.messages]
|
|
</para>
|
|
</blockquote>
|
|
|
|
<para>
|
|
This class has three public member functions, which directly
|
|
correspond to three protected virtual member functions.
|
|
</para>
|
|
|
|
<para>
|
|
The public member functions are:
|
|
</para>
|
|
|
|
<para>
|
|
<code>catalog open(const string&, const locale&) const</code>
|
|
</para>
|
|
|
|
<para>
|
|
<code>string_type get(catalog, int, int, const string_type&) const</code>
|
|
</para>
|
|
|
|
<para>
|
|
<code>void close(catalog) const</code>
|
|
</para>
|
|
|
|
<para>
|
|
While the virtual functions are:
|
|
</para>
|
|
|
|
<para>
|
|
<code>catalog do_open(const string& name, const locale& loc) const</code>
|
|
</para>
|
|
<blockquote>
|
|
<para>
|
|
<emphasis>
|
|
-1- Returns: A value that may be passed to <code>get()</code> to retrieve a
|
|
message, from the message catalog identified by the string <code>name</code>
|
|
according to an implementation-defined mapping. The result can be used
|
|
until it is passed to <code>close()</code>. Returns a value less than 0 if no such
|
|
catalog can be opened.
|
|
</emphasis>
|
|
</para>
|
|
</blockquote>
|
|
|
|
<para>
|
|
<code>string_type do_get(catalog cat, int set , int msgid, const string_type& dfault) const</code>
|
|
</para>
|
|
<blockquote>
|
|
<para>
|
|
<emphasis>
|
|
-3- Requires: A catalog <code>cat</code> obtained from <code>open()</code> and not yet closed.
|
|
-4- Returns: A message identified by arguments <code>set</code>, <code>msgid</code>, and <code>dfault</code>,
|
|
according to an implementation-defined mapping. If no such message can
|
|
be found, returns <code>dfault</code>.
|
|
</emphasis>
|
|
</para>
|
|
</blockquote>
|
|
|
|
<para>
|
|
<code>void do_close(catalog cat) const</code>
|
|
</para>
|
|
<blockquote>
|
|
<para>
|
|
<emphasis>
|
|
-5- Requires: A catalog cat obtained from <code>open()</code> and not yet closed.
|
|
-6- Effects: Releases unspecified resources associated with <code>cat</code>.
|
|
-7- Notes: The limit on such resources, if any, is implementation-defined.
|
|
</emphasis>
|
|
</para>
|
|
</blockquote>
|
|
|
|
|
|
</section>
|
|
|
|
<section xml:id="facet.messages.design"><info><title>Design</title></info>
|
|
|
|
|
|
<para>
|
|
A couple of notes on the standard.
|
|
</para>
|
|
|
|
<para>
|
|
First, why is <code>messages_base::catalog</code> specified as a typedef
|
|
to int? This makes sense for implementations that use
|
|
<code>catopen</code> and define <code>nl_catd</code> as int, but not for
|
|
others. Fortunately, it's not heavily used and so only a minor irritant.
|
|
This has been reported as a possible defect in the standard (LWG 2028).
|
|
</para>
|
|
|
|
<para>
|
|
Second, by making the member functions <code>const</code>, it is
|
|
impossible to save state in them. Thus, storing away information used
|
|
in the 'open' member function for use in 'get' is impossible. This is
|
|
unfortunate.
|
|
</para>
|
|
|
|
<para>
|
|
The 'open' member function in particular seems to be oddly
|
|
designed. The signature seems quite peculiar. Why specify a <code>const
|
|
string& </code> argument, for instance, instead of just <code>const
|
|
char*</code>? Or, why specify a <code>const locale&</code> argument that is
|
|
to be used in the 'get' member function? How, exactly, is this locale
|
|
argument useful? What was the intent? It might make sense if a locale
|
|
argument was associated with a given default message string in the
|
|
'open' member function, for instance. Quite murky and unclear, on
|
|
reflection.
|
|
</para>
|
|
|
|
<para>
|
|
Lastly, it seems odd that messages, which explicitly require code
|
|
conversion, don't use the codecvt facet. Because the messages facet
|
|
has only one template parameter, it is assumed that ctype, and not
|
|
codecvt, is to be used to convert between character sets.
|
|
</para>
|
|
|
|
<para>
|
|
It is implicitly assumed that the locale for the default message
|
|
string in 'get' is in the "C" locale. Thus, all source code is assumed
|
|
to be written in English, so translations are always from "en_US" to
|
|
other, explicitly named locales.
|
|
</para>
|
|
|
|
</section>
|
|
|
|
<section xml:id="facet.messages.impl"><info><title>Implementation</title></info>
|
|
|
|
|
|
<section xml:id="messages.impl.models"><info><title>Models</title></info>
|
|
|
|
<para>
|
|
This is a relatively simple class, on the face of it. The standard
|
|
specifies very little in concrete terms, so generic
|
|
implementations that are conforming yet do very little are the
|
|
norm. Adding functionality that would be useful to programmers and
|
|
comparable to Java's java.text.MessageFormat takes a bit of work,
|
|
and is highly dependent on the capabilities of the underlying
|
|
operating system.
|
|
</para>
|
|
|
|
<para>
|
|
Three different mechanisms have been provided, selectable via
|
|
configure flags:
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem>
|
|
<para>
|
|
generic
|
|
</para>
|
|
<para>
|
|
This model does very little, and is what is used by default.
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
gnu
|
|
</para>
|
|
<para>
|
|
The gnu model is complete and fully tested. It's based on the
|
|
GNU gettext package, which is part of glibc. It uses the
|
|
functions <code>textdomain, bindtextdomain, gettext</code> to
|
|
implement full functionality. Creating message catalogs is a
|
|
relatively straight-forward process and is lightly documented
|
|
below, and fully documented in gettext's distributed
|
|
documentation.
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
ieee_1003.1-200x
|
|
</para>
|
|
<para>
|
|
This is a complete, though untested, implementation based on
|
|
the IEEE standard. The functions <code>catopen, catgets,
|
|
catclose</code> are used to retrieve locale-specific messages
|
|
given the appropriate message catalogs that have been
|
|
constructed for their use. Note, the script <code>
|
|
po2msg.sed</code> that is part of the gettext distribution can
|
|
convert gettext catalogs into catalogs that
|
|
<code>catopen</code> can use.
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
<para>
|
|
A new, standards-conformant non-virtual member function signature was
|
|
added for 'open' so that a directory could be specified with a given
|
|
message catalog. This simplifies calling conventions for the gnu
|
|
model.
|
|
</para>
|
|
|
|
</section>
|
|
|
|
<section xml:id="messages.impl.gnu"><info><title>The GNU Model</title></info>
|
|
|
|
|
|
<para>
|
|
The messages facet, because it is retrieving and converting
|
|
between characters sets, depends on the ctype and perhaps the
|
|
codecvt facet in a given locale. In addition, underlying "C"
|
|
library locale support is necessary for more than just the
|
|
<code>LC_MESSAGES</code> mask: <code>LC_CTYPE</code> is also
|
|
necessary. To avoid any unpleasantness, all bits of the "C" mask
|
|
(i.e. <code>LC_ALL</code>) are set before retrieving messages.
|
|
</para>
|
|
|
|
<para>
|
|
Making the message catalogs can be initially tricky, but become
|
|
quite simple with practice. For complete info, see the gettext
|
|
documentation. Here's an idea of what is required:
|
|
</para>
|
|
|
|
<itemizedlist>
|
|
<listitem>
|
|
<para>
|
|
Make a source file with the required string literals that need
|
|
to be translated. See <code>intl/string_literals.cc</code> for
|
|
an example.
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
Make initial catalog (see "4 Making the PO Template File" from
|
|
the gettext docs).</para>
|
|
<para>
|
|
<code> xgettext --c++ --debug string_literals.cc -o libstdc++.pot </code>
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>Make language and country-specific locale catalogs.</para>
|
|
<para>
|
|
<code>cp libstdc++.pot fr_FR.po</code>
|
|
</para>
|
|
<para>
|
|
<code>cp libstdc++.pot de_DE.po</code>
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
Edit localized catalogs in emacs so that strings are
|
|
translated.
|
|
</para>
|
|
<para>
|
|
<code>emacs fr_FR.po</code>
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>Make the binary mo files.</para>
|
|
<para>
|
|
<code>msgfmt fr_FR.po -o fr_FR.mo</code>
|
|
</para>
|
|
<para>
|
|
<code>msgfmt de_DE.po -o de_DE.mo</code>
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>Copy the binary files into the correct directory structure.</para>
|
|
<para>
|
|
<code>cp fr_FR.mo (dir)/fr_FR/LC_MESSAGES/libstdc++.mo</code>
|
|
</para>
|
|
<para>
|
|
<code>cp de_DE.mo (dir)/de_DE/LC_MESSAGES/libstdc++.mo</code>
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>Use the new message catalogs.</para>
|
|
<para>
|
|
<code>locale loc_de("de_DE");</code>
|
|
</para>
|
|
<para>
|
|
<code>
|
|
use_facet<messages<char> >(loc_de).open("libstdc++", locale(), dir);
|
|
</code>
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
</section>
|
|
</section>
|
|
|
|
<section xml:id="facet.messages.use"><info><title>Use</title></info>
|
|
|
|
<para>
|
|
A simple example using the GNU model of message conversion.
|
|
</para>
|
|
|
|
<programlisting>
|
|
#include <iostream>
|
|
#include <locale>
|
|
using namespace std;
|
|
|
|
void test01()
|
|
{
|
|
typedef messages<char>::catalog catalog;
|
|
const char* dir =
|
|
"/mnt/egcs/build/i686-pc-linux-gnu/libstdc++/po/share/locale";
|
|
const locale loc_de("de_DE");
|
|
const messages<char>& mssg_de = use_facet<messages<char> >(loc_de);
|
|
|
|
catalog cat_de = mssg_de.open("libstdc++", loc_de, dir);
|
|
string s01 = mssg_de.get(cat_de, 0, 0, "please");
|
|
string s02 = mssg_de.get(cat_de, 0, 0, "thank you");
|
|
cout << "please in german:" << s01 << '\n';
|
|
cout << "thank you in german:" << s02 << '\n';
|
|
mssg_de.close(cat_de);
|
|
}
|
|
</programlisting>
|
|
|
|
</section>
|
|
|
|
<section xml:id="facet.messages.future"><info><title>Future</title></info>
|
|
|
|
|
|
<itemizedlist>
|
|
<listitem>
|
|
<para>
|
|
Things that are sketchy, or remain unimplemented:
|
|
</para>
|
|
<itemizedlist>
|
|
<listitem>
|
|
<para>
|
|
_M_convert_from_char, _M_convert_to_char are in flux,
|
|
depending on how the library ends up doing character set
|
|
conversions. It might not be possible to do a real character
|
|
set based conversion, due to the fact that the template
|
|
parameter for messages is not enough to instantiate the
|
|
codecvt facet (1 supplied, need at least 2 but would prefer
|
|
3).
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
There are issues with gettext needing the global locale set
|
|
to extract a message. This dependence on the global locale
|
|
makes the current "gnu" model non MT-safe. Future versions
|
|
of glibc, i.e. glibc 2.3.x will fix this, and the C++ library
|
|
bits are already in place.
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para>
|
|
Development versions of the GNU "C" library, glibc 2.3 will allow
|
|
a more efficient, MT implementation of std::messages, and will
|
|
allow the removal of the _M_name_messages data member. If this is
|
|
done, it will change the library ABI. The C++ parts to support
|
|
glibc 2.3 have already been coded, but are not in use: once this
|
|
version of the "C" library is released, the marked parts of the
|
|
messages implementation can be switched over to the new "C"
|
|
library functionality.
|
|
</para>
|
|
</listitem>
|
|
<listitem>
|
|
<para>
|
|
At some point in the near future, std::numpunct will probably use
|
|
std::messages facilities to implement truename/falsename
|
|
correctly. This is currently not done, but entries in
|
|
libstdc++.pot have already been made for "true" and "false" string
|
|
literals, so all that remains is the std::numpunct coding and the
|
|
configure/make hassles to make the installed library search its
|
|
own catalog. Currently the libstdc++.mo catalog is only searched
|
|
for the testsuite cases involving messages members.
|
|
</para>
|
|
</listitem>
|
|
|
|
<listitem>
|
|
<para> The following member functions:</para>
|
|
|
|
<para>
|
|
<code>
|
|
catalog
|
|
open(const basic_string<char>& __s, const locale& __loc) const
|
|
</code>
|
|
</para>
|
|
|
|
<para>
|
|
<code>
|
|
catalog
|
|
open(const basic_string<char>&, const locale&, const char*) const;
|
|
</code>
|
|
</para>
|
|
|
|
<para>
|
|
Don't actually return a "value less than 0 if no such catalog
|
|
can be opened" as required by the standard in the "gnu"
|
|
model. As of this writing, it is unknown how to query to see
|
|
if a specified message catalog exists using the gettext
|
|
package.
|
|
</para>
|
|
</listitem>
|
|
</itemizedlist>
|
|
|
|
</section>
|
|
|
|
<bibliography xml:id="facet.messages.biblio"><info><title>Bibliography</title></info>
|
|
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
The GNU C Library
|
|
</citetitle>
|
|
<author><personname><surname>McGrath</surname><firstname>Roland</firstname></personname></author>
|
|
<author><personname><surname>Drepper</surname><firstname>Ulrich</firstname></personname></author>
|
|
<copyright>
|
|
<year>2007</year>
|
|
<holder>FSF</holder>
|
|
</copyright>
|
|
<pagenums>Chapters 6 Character Set Handling, and 7 Locales and Internationalization
|
|
</pagenums>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
Correspondence
|
|
</citetitle>
|
|
<author><personname><surname>Drepper</surname><firstname>Ulrich</firstname></personname></author>
|
|
<copyright>
|
|
<year>2002</year>
|
|
<holder/>
|
|
</copyright>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
ISO/IEC 14882:1998 Programming languages - C++
|
|
</citetitle>
|
|
<copyright>
|
|
<year>1998</year>
|
|
<holder>ISO</holder>
|
|
</copyright>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
ISO/IEC 9899:1999 Programming languages - C
|
|
</citetitle>
|
|
|
|
<copyright>
|
|
<year>1999</year>
|
|
<holder>ISO</holder>
|
|
</copyright>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<title>
|
|
<link xmlns:xlink="http://www.w3.org/1999/xlink"
|
|
xlink:href="https://pubs.opengroup.org/onlinepubs/9699919799/">
|
|
System Interface Definitions, Issue 7 (IEEE Std. 1003.1-2008)
|
|
</link>
|
|
</title>
|
|
<copyright>
|
|
<year>2008</year>
|
|
<holder>
|
|
The Open Group/The Institute of Electrical and Electronics
|
|
Engineers, Inc.
|
|
</holder>
|
|
</copyright>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
The C++ Programming Language, Special Edition
|
|
</citetitle>
|
|
<author><personname><surname>Stroustrup</surname><firstname>Bjarne</firstname></personname></author>
|
|
<copyright>
|
|
<year>2000</year>
|
|
<holder>Addison Wesley, Inc.</holder>
|
|
</copyright>
|
|
<pagenums>Appendix D</pagenums>
|
|
<publisher>
|
|
<publishername>
|
|
Addison Wesley
|
|
</publishername>
|
|
</publisher>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<citetitle>
|
|
Standard C++ IOStreams and Locales
|
|
</citetitle>
|
|
<subtitle>
|
|
Advanced Programmer's Guide and Reference
|
|
</subtitle>
|
|
<author><personname><surname>Langer</surname><firstname>Angelika</firstname></personname></author>
|
|
<author><personname><surname>Kreft</surname><firstname>Klaus</firstname></personname></author>
|
|
<copyright>
|
|
<year>2000</year>
|
|
<holder>Addison Wesley Longman, Inc.</holder>
|
|
</copyright>
|
|
<publisher>
|
|
<publishername>
|
|
Addison Wesley Longman
|
|
</publishername>
|
|
</publisher>
|
|
</biblioentry>
|
|
|
|
<biblioentry>
|
|
<title>
|
|
<link xmlns:xlink="http://www.w3.org/1999/xlink"
|
|
xlink:href="https://docs.oracle.com/en/java/">
|
|
API Specifications, Java Platform
|
|
</link>
|
|
</title>
|
|
|
|
<pagenums>java.util.Properties, java.text.MessageFormat,
|
|
java.util.Locale, java.util.ResourceBundle
|
|
</pagenums>
|
|
</biblioentry>
|
|
|
|
|
|
<biblioentry>
|
|
<title>
|
|
<link xmlns:xlink="http://www.w3.org/1999/xlink"
|
|
xlink:href="https://www.gnu.org/software/gettext/">
|
|
GNU gettext tools, version 0.10.38, Native Language Support
|
|
Library and Tools.
|
|
</link>
|
|
</title>
|
|
|
|
</biblioentry>
|
|
|
|
</bibliography>
|
|
|
|
</section>
|