carb::RString
Defined in carb/RString.h
-
class RString : public carb::detail::RStringTraits<false>
Carbonite registered strings.
The Carbonite framework has a rich string-interning interface that is very easily used through the RString (and other) classes. This implements a Flyweight pattern for strings. The registered string interface is fully ABI-safe due to versioning, and can even be used in an application prior to the
main()
,WinMain()
orDllMain()
functions being called. Furthermore, the API is fully thread-safe.Registered strings have pre-computed hashes which make them ideal for identifiers and map keys, and string (in-)equality checks are O(1) constant time. For ordered containers, registered strings have an
owner_before()
function that can be used for stable (though not lexicographical) ordering. If lexicographical ordering is desired, O(n)compare()
functions are provided.Variations exist around case-sensitivity. The RStringU class (the U stands for “un-cased” which is used in this API to denote case-insensitivity) is used to register a string that will compare in a case-insensitive manner. Although RString and RStringU cannot be directly compared for equality, RString::toUncased() exists to explicitly create a case-insensitive RStringU from an RString which can then be compared.
Variations also exist around using registered strings as a key value. It can be useful to have an associated number to denote multiple instances of a registered string: hence the RStringKey and RStringUKey classes.
To register a string, pass a string to the RString constructor RAII-style. Strings that are registered stay as such for the entire run of the application; strings are never unregistered. Registered strings are stored in a named section of shared memory accessible by all modules loaded by an application. The memory for registered strings is allocated directly from the operating system to avoid cross-DLL heap issues.
Variations:
RStringU - an “un-cased” (i.e. case-insensitive) registered string
RStringKey - Adds a numeric component to RString to create an identifier or key.
RStringUKey - Adds a numeric component to RStringU to create an identifier or key that is case-insensitive.
Note
Registered strings are a limited resource, but there exists slots for approximately two million strings.
Public Functions
-
constexpr RString() noexcept
Default constructor.
isEmpty() will report
true
.
-
constexpr RString(eRString staticString) noexcept
Initializes this registered string to one of the static pre-defined registered strings.
- Parameters
staticString – The pre-defined registered string to use.
-
explicit RString(const char *str, RStringOp op = RStringOp::eRegister)
Finds or registers a new string.
- Parameters
str – The string to find or register.
op – The operation to perform. If directed to RStringOp::eFindExisting and the string has not been previously registered,
*this
is initialized as if with the default constructor.
-
explicit RString(const char *str, size_t len, RStringOp op = RStringOp::eRegister)
Finds or registers a new counted string.
Note
While generally not recommended, passing
len
allows the given string to contain embedded NUL (‘\0’) characters.- Parameters
str – The string to find or register.
len – The number of characters of
str
to include.op – The operation to perform. If directed to RStringOp::eFindExisting and the string has not been previously registered,
*this
is initialized as if with the default constructor.
-
explicit RString(const std::string &str, RStringOp op = RStringOp::eRegister)
Finds or registers a new
std::string
.Note
If
str
contains embedded NUL (‘\0’) characters, the RString will contain the embedded NUL characters as well.- Parameters
str – The
std::string
to find or register.op – The operation to perform. If directed to RStringOp::eFindExisting and the string has not been previously registered,
*this
is initialized as if with the default constructor.
-
explicit RString(const RStringKey &other) noexcept
Truncates RStringKey into only the registered string portion.
- Parameters
other – The RStringKey to truncate.
-
RStringU toUncased() const noexcept
Converts this registered string into an “un-cased” (i.e.
case-insensitive) registered string.
Note
The returned string may differ in case to
*this
when retrieved with c_str() or toString().- Returns
An “un-cased” (i.e. case-insensitive) string that matches
*this
when compared in a case-insensitive manner.
-
RString truncate() const noexcept
Returns a copy of this registered string.
Note
This function exists for compatibility with the RStringKey interface.
- Returns
*this
since this string already has no number component.
-
RStringKey toRStringKey(int32_t number = 0) const
Appends a number to the registered string to form a RStringKey.
- Parameters
number – An optional number to append (default =
0
).- Returns
An RStringKey based on
*this
and the provided number.
-
bool operator==(const RString &other) const noexcept
Equality comparison between this registered string and another.
- Parameters
other – Another registered string.
- Returns
true
if*this
andother
represent the same registered string;false
otherwise.
-
bool operator!=(const RString &other) const noexcept
Inequality comparison between this registered string and another.
- Parameters
other – Another registered string.
- Returns
false
if*this
andother
represent the same registered string;true
otherwise.
-
bool owner_before(const RString &other) const noexcept
Checks whether this registered string is stably (but not lexicographically) ordered before another registered string.
This ordering is to make registered strings usable as keys in ordered associative containers in O(1) time.
Note
This is NOT a lexicographical comparison; for that use one of the compare() functions. To reduce ambiguity between a strict ordering and lexicographical comparison there is no
operator<
function for this string class. While a lexicographical comparison would be O(n), this comparison is O(1).- Parameters
other – Another registered string.
- Returns
true
if*this
should be ordered-beforeother
;false
otherwise.
-
bool isValid() const noexcept
Checks to see if this registered string has been corrupted.
Note
It is not possible for this registered string to become corrupted through normal use of the API. It could be caused by bad casts or use-after-free.
- Returns
true
if*this
represents a valid registered string;false
if*this
is corrupted.
-
constexpr bool isEmpty() const noexcept
Checks to see if this registered string represents the “” (empty) value.
- Returns
true
if*this
is default-initialized or initialized to eRString::Empty;false
otherwise.
-
constexpr bool isUncased() const noexcept
Checks to see if this registered string represents an “un-cased” (i.e.
case-insensitive) registered string.
- Returns
true
if*this
is “un-cased” (i.e. case-insensitive);false
if case-sensitive.
-
constexpr uint32_t getStringId() const noexcept
Returns the registered string ID.
This ID is only useful for debugging purposes and should not be used for comparisons.
- Returns
The string ID for this registered string.
-
size_t getHash() const
Returns the hash value as by
carb::hashString(this->c_str())
.Note
This value is computed once for a registered string and cached, so this operation is generally very fast.
- Returns
The hash value as computed by
carb::hashString(this->c_str())
.
-
size_t getUncasedHash() const noexcept
Returns the hash value as by
carb::hashLowercaseString(this->c_str())
.Note
This value is pre-computed for registered strings and cached, so this operation is always O(1).
- Returns
The hash value as computed by
carb::hashLowercaseString(this->c_str())
.
-
const char *c_str() const noexcept
Resolves this registered string to a C-style NUL-terminated string.
Note
This operation is O(1).
- Returns
The C-style string previously registered.
-
const char *data() const noexcept
An alias for c_str(); resolves this registered string to a C-style NUL-terminated string.
Note
This operation is O(1).
- Returns
The C-style string previously registered.
-
size_t length() const noexcept
Returns the length of the registered string.
If the string contains embedded NUL (‘\0’) characters this may differ from
std::strlen(c_str())
.Note
This operation is O(1).
- Returns
The length of the registered string not including the NUL terminator.
-
bool operator==(const RStringTraits &other) const
Equality comparison between this registered string and another.
- Parameters
other – Another registered string.
- Returns
true
if*this
andother
represent the same registered string;false
otherwise.
-
bool operator!=(const RStringTraits &other) const
Inequality comparison between this registered string and another.
- Parameters
other – Another registered string.
- Returns
false
if*this
andother
represent the same registered string;true
otherwise.
-
bool owner_before(const RStringTraits &other) const
Checks whether this registered string is stably (but not lexicographically) ordered before another registered string.
This ordering is to make registered strings usable as keys in ordered associative containers in O(1) time.
Note
This is NOT a lexicographical comparison; for that use one of the compare() functions. To reduce ambiguity between a strict ordering and lexicographical comparison there is no
operator<
function for this string class. While a lexicographical comparison would be O(n), this comparison is O(1).- Parameters
other – Another registered string.
- Returns
true
if*this
should be ordered-beforeother
;false
otherwise.
-
int compare(const RStringTraits<OtherUncased, OtherBase> &other) const
Lexicographically compares this registered string with another.
Note
If either
*this
orother
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Template Parameters
OtherUncased –
true
ifother
is “un-cased” (i.e. case-insensitive);false
otherwise.- Parameters
other – Another registered string to compare against.
- Returns
0
if the strings are equal,>0
ifother
is lexicographically ordered before*this
, or<0
if*this
is lexicographically ordered beforeother
. See note above regarding case-sensitivity.
-
int compare(const char *s) const
Lexicographically compares this registered string with a C-style string.
Note
If
*this
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Parameters
s – A C-style string to compare against.
- Returns
0
if the strings are equal,>0
ifs
is lexicographically ordered before*this
, or<0
if*this
is lexicographically ordered befores
. See note above regarding case-sensitivity.
-
int compare(size_t pos, size_t count, const char *s) const
Lexicographically compares a substring of this registered string with a C-style string.
Note
If
*this
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Parameters
pos – The starting offset of the registered string represented by
*this
. Must less-than-or-equal-to the length of the registered string.count – The length from
pos
to use in the comparison. This value is automatically clamped to the end of the registered string.s – A C-style string to compare against.
- Returns
0
if the strings are equal,>0
ifs
is lexicographically ordered before the substring of*this
, or<0
if the substring of*this
is lexicographically ordered befores
. See note above regarding case-sensitivity.
-
int compare(size_t pos, size_t count, const char *s, size_t len) const
Lexicographically compares a substring of this registered string with a C-style string.
Note
If
*this
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Parameters
pos – The starting offset of the registered string represented by
*this
. Must less-than-or-equal-to the length of the registered string.count – The length from
pos
to use in the comparison. This value is automatically clamped to the end of the registered string.s – A C-style string to compare against.
len – The number of characters of
s
to compare against.
- Returns
0
if the strings are equal,>0
ifs
is lexicographically ordered before the substring of*this
, or<0
if the substring of*this
is lexicographically ordered befores
. See note above regarding case-sensitivity.
-
int compare(const std::string &s) const
Lexicographically compares this registered string with a string.
Note
If
*this
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Parameters
s – A string to compare against.
- Returns
0
if the strings are equal,>0
ifs
is lexicographically ordered before*this
, or<0
if*this
is lexicographically ordered befores
. See note above regarding case-sensitivity.
-
int compare(size_t pos, size_t count, const std::string &s) const
Lexicographically compares a substring of this registered string with a string.
Note
If
*this
is “un-cased” (i.e. case-insensitive), a case-insensitive compare is performed.- Parameters
pos – The starting offset of the registered string represented by
*this
. Must less-than-or-equal-to the length of the registered string.count – The length from
pos
to use in the comparison. This value is automatically clamped to the end of the registered string.s – A string to compare against.
- Returns
0
if the strings are equal,>0
ifs
is lexicographically ordered before the substring of*this
, or<0
if the substring of*this
is lexicographically ordered befores
. See note above regarding case-sensitivity.
Public Static Attributes
-
static constexpr bool IsUncased
Constant that indicates whether this is “un-cased” (i.e.
case-insensitive).