stringi-search {stringi} | R Documentation |
This man page instructs how to perform string search-based operations in stringi.
The following independent string searching “engines” are available in stringi.
stri_*_regex
– ICU's regular expressions,
see stringi-search-regex,
stri_*_fixed
– locale-independent bytewise pattern matching,
see stringi-search-fixed,
stri_*_coll
– ICU's StringSearch
,
locale-sensitive, Collator-based pattern search,
useful for natural language processing tasks,
see stringi-search-coll,
stri_*_charclass
– character classes search,
e.g. Unicode General Categories or Binary Properties,
see stringi-search-charclass,
stri_*_boundaries
– text boundary analysis,
see stringi-search-boundaries
Each “engine” is able to perform many search-based operations. These may include:
stri_detect_*
- detect if a pattern occurs in a string,
see e.g. stri_detect
,
stri_count_*
- count the number of pattern occurrences,
see e.g. stri_count
,
stri_locate_*
- locate all, first, or last occurrences
of a pattern, see e.g. stri_locate
,
stri_extract_*
- extract all, first, or last occurrences
of a pattern, see e.g. stri_extract
and, in case of regexes, stri_match
,
stri_replace_*
- replace all, first, or last occurrences
of a pattern, see e.g. stri_replace
and also stri_trim
,
stri_split_*
- split a string into chunks indicated
by occurrences of a pattern,
see e.g. stri_split
,
stri_startswith_*
and stri_endswith_*
detect
if a string starts or ends with a pattern match, see
e.g. stri_startswith
,
stri_subset_*
- return a subset of a character vector
with strings that match a given pattern, see e.g. stri_subset
.
Other search_charclass: stri_trim
,
stri_trim
, stri_trim_both
,
stri_trim_left
,
stri_trim_right
;
stringi-search-charclass
Other search_coll: stri_opts_collator
;
stringi-search-coll
Other search_count: stri_count_boundaries
,
stri_count_words
; stri_count
,
stri_count_charclass
,
stri_count_coll
,
stri_count_fixed
,
stri_count_regex
Other search_detect: stri_detect
,
stri_detect_charclass
,
stri_detect_coll
,
stri_detect_fixed
,
stri_detect_regex
;
stri_endswith
,
stri_endswith_charclass
,
stri_endswith_coll
,
stri_endswith_fixed
,
stri_startswith
,
stri_startswith_charclass
,
stri_startswith_coll
,
stri_startswith_fixed
Other search_extract: stri_extract_all_boundaries
,
stri_extract_all_words
,
stri_extract_first_boundaries
,
stri_extract_first_words
,
stri_extract_last_boundaries
,
stri_extract_last_words
;
stri_extract
,
stri_extract_all
,
stri_extract_all_charclass
,
stri_extract_all_coll
,
stri_extract_all_fixed
,
stri_extract_all_regex
,
stri_extract_first
,
stri_extract_first_charclass
,
stri_extract_first_coll
,
stri_extract_first_fixed
,
stri_extract_first_regex
,
stri_extract_last
,
stri_extract_last_charclass
,
stri_extract_last_coll
,
stri_extract_last_fixed
,
stri_extract_last_regex
;
stri_match
, stri_match_all
,
stri_match_all_regex
,
stri_match_first
,
stri_match_first_regex
,
stri_match_last
,
stri_match_last_regex
Other search_fixed: stri_opts_fixed
;
stringi-search-fixed
Other search_locate: stri_locate_all_boundaries
,
stri_locate_all_words
,
stri_locate_first_boundaries
,
stri_locate_first_words
,
stri_locate_last_boundaries
,
stri_locate_last_words
;
stri_locate
, stri_locate_all
,
stri_locate_all_charclass
,
stri_locate_all_coll
,
stri_locate_all_fixed
,
stri_locate_all_regex
,
stri_locate_first
,
stri_locate_first_charclass
,
stri_locate_first_coll
,
stri_locate_first_fixed
,
stri_locate_first_regex
,
stri_locate_last
,
stri_locate_last_charclass
,
stri_locate_last_coll
,
stri_locate_last_fixed
,
stri_locate_last_regex
Other search_regex: stri_opts_regex
;
stringi-search-regex
Other search_replace: stri_replace_na
;
stri_replace
,
stri_replace_all
,
stri_replace_all_charclass
,
stri_replace_all_coll
,
stri_replace_all_fixed
,
stri_replace_all_regex
,
stri_replace_first
,
stri_replace_first_charclass
,
stri_replace_first_coll
,
stri_replace_first_fixed
,
stri_replace_first_regex
,
stri_replace_last
,
stri_replace_last_charclass
,
stri_replace_last_coll
,
stri_replace_last_fixed
,
stri_replace_last_regex
;
stri_trim
, stri_trim
,
stri_trim_both
,
stri_trim_left
,
stri_trim_right
Other search_split: stri_split_boundaries
;
stri_split_lines
,
stri_split_lines1
,
stri_split_lines1
;
stri_split
,
stri_split_charclass
,
stri_split_coll
,
stri_split_fixed
,
stri_split_regex
Other search_subset: stri_subset
,
stri_subset_charclass
,
stri_subset_coll
,
stri_subset_fixed
,
stri_subset_regex
Other stringi_general_topics: stringi-arguments
;
stringi-encoding
;
stringi-locale
;
stringi-search-boundaries
;
stringi-search-charclass
;
stringi-search-coll
;
stringi-search-fixed
;
stringi-search-regex
;
stringi
, stringi-package
Other text_boundaries: stri_count_boundaries
,
stri_count_words
;
stri_extract_all_boundaries
,
stri_extract_all_words
,
stri_extract_first_boundaries
,
stri_extract_first_words
,
stri_extract_last_boundaries
,
stri_extract_last_words
;
stri_locate_all_boundaries
,
stri_locate_all_words
,
stri_locate_first_boundaries
,
stri_locate_first_words
,
stri_locate_last_boundaries
,
stri_locate_last_words
;
stri_opts_brkiter
;
stri_split_boundaries
;
stri_split_lines
,
stri_split_lines1
,
stri_split_lines1
;
stri_trans_tolower
,
stri_trans_totitle
,
stri_trans_toupper
;
stri_wrap
;
stringi-search-boundaries