Mô đun này được đánh giá là sẵn sàng cho sử dụng chung phổ biến rộng rãi. Nó đã đạt đến mức độ hoàn thiện và được cho là không có lỗi và sẵn sàng để sử dụng bất cứ khi nào thích hợp. Nó đã sẵn sàng để đề cập trên các trang trợ giúp và các tài nguyên Wikipedia khác như một tùy chọn cho người dùng mới tìm hiểu. Để giảm tải máy chủ và đầu ra xấu, cần cải thiện nó bằng chỗ thử thay vì lặp đi lặp lại chỉnh sửa thử và bị lỗi. |
The module String2 contains five case-related calls that convert strings to upper, lower, sentence or title case and two calls that are useful for working with substrings. There are two utility calls that strip leading zeros from padded numbers and transform text so that it is not interpreted as wikitext.
The upper function simply converts all characters to upper case.
The lower function simply converts all characters to lower case.
The title function capitalises the first letter of each word in the text, apart from a number of short words recommended by The U.S. Government Printing Office Style Manual: a, an, the, at, by, for, in, of, on, to, up, and, as, but, or, and nor.
The sentence function finds the first letter and capitalises it, then renders the rest of the text in lower case. It works properly with text containing wiki-markup. Compare {{#invoke:String2|sentence|[[action game]]}}
-> Action game with {{ucfirst:{{lc:[[action game]]}}}}
-> action game. Piped wiki-links are handled as well:
{{#invoke:String2|sentence|[[trimix (breathing gas)|trimix]]}}
-> TrimixSo are lists:
{{#invoke:String2 |sentence |{{hlist ||[[apples]] |[[pears]] |[[oranges]]}}}}
→ '"`uniq--templatestyles-00000008-qinu`"'The ucfirst function is similar to sentence; it renders the first alphabetical character in upper case, but leaves the capitalisation of the rest of the text unaltered. This is useful if the text contains proper nouns, but it will not regularise sentences that are ALLCAPS, for example. It also works with text containing piped wiki-links and with html lists. (now obselete)
label
is an alias to this function for backward compatibility.
One potential issue is that using Lua special pattern characters (^$()%.[]*+-?
) as the separator will probably cause problems.
Case | Wikitext | Output |
---|---|---|
Normal usage | {{#invoke:String2 |findlast | 5, 932, 992,532, 6,074,702, 6,145,291}} |
6,145,291 |
Space as separator | {{#invoke:String2 |findlast | 5 932 992,532 6,074,702 6,145,291 }} |
5 932 992,532 6,074,702 6,145,291 |
One item list | {{#invoke:String2 |findlast | 6,074,702 }} |
6,074,702 |
Separator not found | {{#invoke:String2 |findlast | 5, 932, 992,532, 6,074,702, 6,145,291 |;}} |
5, 932, 992,532, 6,074,702, 6,145,291 |
List missing | {{#invoke:String2 |findlast |}} |
|source=
), which is trimmed.|target=
), which is trimmed and any double quotes " are stripped out. That allows spaces at the beginning or end of the match string to be included in a consistent manner.|plain=
), which is trimmed. If it's set to false, then the search accepts Lua pattern-matching for the target, otherwise a plain search is used.|nomatch=
), which is trimmed. This value is returned if no match occurs. Setting |nomatch=0
makes the output compatible with the find function in Module:String.{{#invoke:String2 |posnq |This is a piece of text to be searched |ext}}
→ 21{{#invoke:String2 |posnq |This is a piece of text to be searched |ent}}
→{{#invoke:String2 |posnq |This is a piece of text to be searched |" pie"}}
→ 10{{#invoke:String2 |posnq |This is a piece of text to be searched |" ece"}}
→{{#invoke:String2 |posnq |source=This is a piece of text |target=ece}}
→ 13{{#invoke:String2 |posnq |source=This is a piece of text |target=%s |plain=true}}
→{{#invoke:String2 |posnq |source=This is a piece of text |target=%s |plain=false}}
→ 5{{#invoke:String2 |posnq |source=This is a piece of text |target=ece |nomatch=0}}
→ 13{{#invoke:String2 |posnq |source=This is a piece of text |target=xyz |nomatch=0}}
→ 0{{#invoke:String2 |posnq |This is a piece of text |" of" |true |0}}
→ 16{{#invoke:String2 |posnq |This is a piece of text |" of" |true |0}}
→ 0{{#invoke:String2 |posnq |source=Meet at Café Nero |target=afé}}
→ 10The split function splits text at boundaries specified by separator and returns the chunk for the index idx (starting at 1). It can use positional parameters or named parameters (but these should not be mixed):
{{#invoke:String2 |split |text |separator |index |true/false}}
{{#invoke:String2 |split |txt=text |sep=separator |idx=index |plain=true/false}}
Any double quotes (") in the separator parameter are stripped out, which allows spaces and wikitext like ["[
to be passed. Use {{!}}
for the pipe character |
.
If the optional plain parameter is set to false / no / 0
then separator is treated as a Lua pattern. The default is plain=true, i.e. normal text matching.
The index parameter is optional; it defaults to the first chunk of text.
The Template:Stringsplit is a convenience wrapper for the split function.
The stripZeros functions finds the first number in a string of text and strips leading zeros, but retains a zero which is followed by a decimal point. For example: "0940" -> "940"; "Year: 0023" -> "Year: 23"; "00.12" -> "0.12"
The nowiki function ensures that a string of text is treated by the MediaWiki software as just a string, not code. It trims leading and trailing whitespace.
The val2percent functions scans through a string, passed as either the first unnamed parameter or |txt=, and converts each number it finds into a percentage, then returns the resulting string.
The one2a function scans through a string, passed as either the first unnamed parameter or |txt=, and converts each occurrence of 'one ' into either 'a ' or 'an ', then returns the resultant string.
The Template:One2a is a convenience wrapper for the one2a function.
The findpagetext function returns the position of a piece of text in the wikitext source of a page. It takes up to four parameters:
{{#invoke:String2 |findpagetext |text=Youghiogheny}}
→{{#invoke:String2 |findpagetext |text=Youghiogheny |nomatch=not found}}
→ not found{{#invoke:String2 |findpagetext |text=Youghiogheny |title=Boston Bridge |nomatch=not found}}
→ Lỗi Lua: bad argument #1 to 'find' (string expected, got nil).{{#invoke:String2 |findpagetext |text=river |title=Boston Bridge |nomatch=not found}}
→ Lỗi Lua: bad argument #1 to 'find' (string expected, got nil).{{#invoke:String2 |findpagetext |text=[Rr]iver |title=Boston Bridge |plain=false |nomatch=not found}}
→ Lỗi Lua: bad argument #1 to 'find' (string expected, got nil).{{#invoke:String2 |findpagetext |text=%[%[ |title=Boston Bridge |plain=f |nomatch=not found}}
→ Lỗi Lua: bad argument #1 to 'find' (string expected, got nil).{{#invoke:String2 |findpagetext |text=%{%{[Cc]oord |title=Boston Bridge |plain=f |nomatch=not found}}
→ Lỗi Lua: bad argument #1 to 'find' (string expected, got nil).The search is case-sensitive, so Lua pattern matching is needed to find river
or River
. The last example finds {{coord
and {{Coord
. The penultimate example finds a wiki-link.
The Template:Findpagetext is a convenience wrapper for this function.
{{#invoke:String2 | upper |…}}
- Shifts all characters to uppercase{{#invoke:String2 | lower |…}}
- Shifts all characters to lowercase{{#invoke:String2 | sentence |…}}
- Capitalizes the first character and shifts the rest to lowercase
{{ucfirst:}}
function, this call works even with piped wiki-links because it searches beyond leading brackets and other non-alphanumeric characters.<li>
) and any piped links that may be there.{{#invoke:String2 | ucfirst |…}}
- Capitalizes the first alphabetic character and leaves the rest unaltered
{{#invoke:String2 | title |…}}
- Capitalizes all words, except for a
, an
, the
, at
, by
, for
, in
, of
, on
, to
, up
, and
, as
, but
, or
, and nor
.{{#invoke:String2 | stripZeros |…}}
- Removes leading padding zeros from the first number it finds in the string{{#invoke:String2 | title |…}}
- Renders the string as plain text without wikicodeThese functions take one unnamed parameter comprising (or invoking as a string) the text to be manipulated:
Input | Output |
---|---|
{{#invoke:String2| upper | abcd }} | ABCD |
{{#invoke:String2| upper | abCD }} | ABCD |
{{#invoke:String2| upper | ABcd }} | ABCD |
{{#invoke:String2| upper | ABCD }} | ABCD |
{{#invoke:String2| upper | }} | |
{{#invoke:String2| lower | abcd }} | abcd |
{{#invoke:String2| lower | abCD }} | abcd |
{{#invoke:String2| lower | ABcd }} | abcd |
{{#invoke:String2| lower | ABCD }} | abcd |
{{#invoke:String2| lower | }} | |
{{#invoke:String2| ucfirst | abcd }} | Abcd |
{{#invoke:String2| ucfirst | abCD }} | AbCD |
{{#invoke:String2| ucfirst | ABcd }} | ABcd |
{{#invoke:String2| ucfirst | ABCD }} | ABCD |
{{#invoke:String2| ucfirst | 123abcd }} | 123Abcd |
{{#invoke:String2| ucfirst | }} | |
{{#invoke:String2| ucfirst | human X chromosome }} | Human X chromosome |
{{#invoke:String2| sentence | abcd }} | Abcd |
{{#invoke:String2| sentence | abCD }} | Abcd |
{{#invoke:String2| sentence | ABcd }} | Abcd |
{{#invoke:String2| sentence | ABCD }} | Abcd |
{{#invoke:String2| sentence | [[action game]] }} | Action game |
{{#invoke:String2| sentence | [[trimix (breathing gas)|trimix]] }} | Trimix |
{{#invoke:String2 | sentence | {{#invoke:WikidataIB |getValue |P136 |fetchwikidata=ALL |onlysourced=no |qid=Q1396889}} }} | Roman à clef, satirical fiction, ngụ ngôn, dystopian fiction |
{{#invoke:String2 | sentence | {{#invoke:WikidataIB |getValue |P106 |fetchwikidata=ALL |list=hlist |qid=Q453196}} }} | '"`uniq--templatestyles-00000048-qinu`"'
|
{{#invoke:String2| sentence | }} | |
{{#invoke:String2| title | abcd }} | Abcd |
{{#invoke:String2| title | abCD }} | Abcd |
{{#invoke:String2| title | ABcd }} | Abcd |
{{#invoke:String2| title | ABCD }} | Abcd |
{{#invoke:String2| title | }} | |
{{#invoke:String2| title | the vitamins are in my fresh california raisins}} | The Vitamins Are in My Fresh California Raisins |
Template:Posnq is a convenience wrapper for the posnq function.
{{Posnq |This is a piece of text to be searched |piece of }}
→ 11{{Posnq |This is a piece oftext to be searched |piece of }}
→ 11{{Posnq |This is a piece of text to be searched |"piece of "}}
→ 11{{Posnq |This is a piece oftext to be searched |"piece of "}}
→Template:Stringsplit is a convenience wrapper for the split function.
{{Stringsplit |This is a piece of text to be split |" "}}
→ This{{Stringsplit |This is a piece of text to be split |" "| 4}}
→ piece{{Stringsplit |This is a piece of text to be split |x| 2}}
→ t to be splitModules may return strings with | as separators like this: {{#invoke:carousel | main | name = WPDogs | switchsecs = 5 }}
→ (2)BIR Grupp 7- KORTHÅRIG VORSTEH, J Björnkärrets Hertzogin Aida (24208119306).jpg|German Shorthaired Pointer
{{Stringsplit |{{#invoke:carousel | main | name = WPDogs | switchsecs = 5 }}|{{!}}| 2}}
→ German Shorthaired PointerLua patterns can allow splitting at classes of characters such as punctuation:
{{Stringsplit |Apples, pears, oranges; Cats, dogs|"%p"| 2 |false}}
→ pears{{Stringsplit |Apples, pears, oranges; Cats, dogs|"%p"| 4 |false}}
→ CatsOr split on anything that isn't a letter (no is treated as false):
{{Stringsplit |Apples pears oranges; Cats dogs|"%A+"| 4 |no}}
→ CatsNamed parameters force the trimming of leading and trailing spaces in the parameters and are generally clearer when used:
{{Stringsplit | txt=Apples pears oranges; Cats dogs | sep="%A+" | idx=3 | plain=false }}
→ orangesTemplate:One2a is a convenience wrapper for the one2a function.
Capitalisation is kept. Aimed for usage with {{Convert}}.
{{one2a |One foot. One mile. One kilometer. One inch.One amp. one foot. one mile. one inch. Alone at last. Onely the lonely. ONE ounce. One monkey.}}
→{{convert|1|ft|spell=on}}
→ một foot (không phẩy ba không mét){{one2a|{{convert|1|ft|spell=on}}}}
→ một foot (không phẩy ba không mét){{convert|2.54|cm|0|disp=out|spell=on}}
→ một inch[chuyển đổi: số không hợp lệ]{{one2a|{{convert|2.54|cm|0|disp=out|spell=on}}}}
→ một inch[chuyển đổi: số không hợp lệ]Module:String for the following functions:
Templates and modules related to capitalization
Magic words that rewrite the output (copy-paste will get the text as displayed, not as entered):
{{lc:}}
– lower case output of the full text{{uc:}}
– upper case output of the full text{{lcfirst:}}
– lower case output of the first character only{{ucfirst:}}
– upper case output of the first character only
Templates that implement <nowiki>
local p = {}
p.upper = function(frame)
local s = mw.text.trim(frame.args[1] or "")
return string.upper(s)
end
p.lower = function(frame)
local s = mw.text.trim(frame.args[1] or "")
return string.lower(s)
end
p.sentence = function (frame )
frame.args[1] = string.lower(frame.args[1])
return p.ucfirst(frame)
end
p.ucfirst = function (frame )
local s = mw.text.trim( frame.args[1] or "" )
local s1 = ""
-- if it's a list chop off and (store as s1) everything up to the first <li>
local lipos = mw.ustring.find(s, "<li>" )
if lipos then
s1 = mw.ustring.sub(s, 1, lipos + 3)
s = mw.ustring.sub(s, lipos + 4)
end
-- s1 is either "" or the first part of the list markup, so we can continue
-- and prepend s1 to the returned string
local letterpos
if mw.ustring.find(s, "^%[%[[^|]+|[^%]]+%]%]") then
-- this is a piped wikilink, so we capitalise the text, not the pipe
local _
_, letterpos = mw.ustring.find(s, "|%A*%a") -- find the first letter after the pipe
else
letterpos = mw.ustring.find(s, '%a')
end
if letterpos then
local first = mw.ustring.sub(s, 1, letterpos - 1)
local letter = mw.ustring.sub(s, letterpos, letterpos)
local rest = mw.ustring.sub(s, letterpos + 1)
return s1 .. first .. mw.ustring.upper(letter) .. rest
else
return s1 .. s
end
end
p.title = function (frame )
-- http://grammar.yourdictionary.com/capitalization/rules-for-capitalization-in-titles.html
-- recommended by The U.S. Government Printing Office Style Manual:
-- "Capitalize all words in titles of publications and documents,
-- except a, an, the, at, by, for, in, of, on, to, up, and, as, but, or, and nor."
local alwayslower = {['a'] = 1, ['an'] = 1, ['the'] = 1,
['and'] = 1, ['but'] = 1, ['or'] = 1, ['for'] = 1,
['nor'] = 1, ['on'] = 1, ['in'] = 1, ['at'] = 1, ['to'] = 1,
['from'] = 1, ['by'] = 1, ['of'] = 1, ['up'] = 1 }
local res = ''
local s = mw.text.trim( frame.args[1] or "" )
local words = mw.text.split( s, " ")
for i, s in ipairs(words) do
s = string.lower( s )
if( i > 1 and alwayslower[s] == 1) then
-- leave in lowercase
else
s = mw.getContentLanguage():ucfirst(s)
end
words[i] = s
end
return table.concat(words, " ")
end
-- findlast finds the last item in a list
-- the first unnamed parameter is the list
-- the second, optional unnamed parameter is the list separator (default = comma space)
-- returns the whole list if separator not found
p.findlast = function(frame)
local s = mw.text.trim( frame.args[1] or "" )
local sep = frame.args[2] or ""
if sep == "" then sep = ", " end
local pattern = ".*" .. sep .. "(.*)"
a, b, last = s:find(pattern)
if a then
return last
else
return s
end
end
-- stripZeros finds the first number and strips leading zeros (apart from units)
-- e.g "0940" -> "940"; "Year: 0023" -> "Year: 23"; "00.12" -> "0.12"
p.stripZeros = function(frame)
local s = mw.text.trim(frame.args[1] or "")
local n = tonumber( string.match( s, "%d+" ) ) or ""
s = string.gsub( s, "%d+", n, 1 )
return s
end
-- nowiki ensures that a string of text is treated by the MediaWiki software as just a string
-- it takes an unnamed parameter and trims whitespace, then removes any wikicode
p.nowiki = function(frame)
local str = mw.text.trim(frame.args[1] or "")
return mw.text.nowiki(str)
end
-- posnq (position, no quotes) returns the numerical start position of the first occurrence
-- of one piece of text ("match") inside another ("str").
-- It returns nil if no match is found, or if either parameter is blank.
-- It takes the text to be searched in as the first unnamed parameter, which is trimmed.
-- It takes the text to match as the second unnamed parameter, which is trimmed and
-- any double quotes " are stripped out.
p.posnq = function(frame)
local args = frame.args
local pargs = frame:getParent().args
for k, v in pairs(pargs) do
args[k] = v
end
local str = mw.text.trim(args[1] or args.source or "")
local match = mw.text.trim(args[2] or args.target or ""):gsub('"', '')
if str == "" or match == "" then return nil end
local plain = mw.text.trim(args[3] or args.plain or "")
if plain == "false" then plain = false else plain = true end
local nomatch = mw.text.trim(args[4] or args.nomatch or "")
-- just take the start position
local pos = mw.ustring.find(str, match, 1, plain) or nomatch
return pos
end
-- split splits text at boundaries specified by separator
-- and returns the chunk for the index idx (starting at 1)
-- #invoke:String2 |split |text |separator |index |true/false
-- #invoke:String2 |split |txt=text |sep=separator |idx=index |plain=true/false
-- if plain is false/no/0 then separator is treated as a Lua pattern - defaults to plain=true
p.split = function(frame)
local args = frame.args
if not(args[1] or args.txt) then args = frame:getParent().args end
local txt = args[1] or args.txt or ""
if txt == "" then return nil end
local sep = (args[2] or args.sep or ""):gsub('"', '')
local idx = tonumber(args[3] or args.idx) or 1
local plain = (args[4] or args.plain or "true"):sub(1,1)
plain = (plain ~= "f" and plain ~= "n" and plain ~= "0")
local splittbl = mw.text.split( txt, sep, plain )
if idx < 0 then idx = #splittbl + idx + 1 end
return splittbl[idx]
end
-- val2percent scans through a string, passed as either the first unnamed parameter or |txt=
-- it converts each number it finds into a percentage and returns the resultant string.
p.val2percent = function(frame)
local args = frame.args
if not(args[1] or args.txt) then args = frame:getParent().args end
local txt = mw.text.trim(args[1] or args.txt or "")
if txt == "" then return nil end
local function v2p (x)
x = (tonumber(x) or 0) * 100
if x == math.floor(x) then x = math.floor(x) end
return x .. "%"
end
txt = txt:gsub("%d[%d%.]*", v2p) -- store just the string
return txt
end
-- one2a scans through a string, passed as either the first unnamed parameter or |txt=
-- it converts each occurrence of 'one ' into either 'a ' or 'an ' and returns the resultant string.
p.one2a = function(frame)
local args = frame.args
if not(args[1] or args.txt) then args = frame:getParent().args end
local txt = mw.text.trim(args[1] or args.txt or "")
if txt == "" then return nil end
txt = txt:gsub(" one ", " a "):gsub("^one", "a"):gsub("One ", "A "):gsub("a ([aeiou])", "an %1"):gsub("A ([aeiou])", "An %1")
return txt
end
-- findpagetext returns the position of a piece of text in a page
-- First positional parameter or |text is the search text
-- Optional parameter |title is the page title, defaults to current page
-- Optional parameter |plain is either true for plain search (default) or false for Lua pattern search
-- Optional parameter |nomatch is the return value when no match is found; default is nil
p._findpagetext = function(args)
-- process parameters
local nomatch = args.nomatch or ""
if nomatch == "" then nomatch = nil end
--
local text = mw.text.trim(args[1] or args.text or "")
if text == "" then return nil end
--
local title = args.title or ""
local titleobj
if title == "" then
titleobj = mw.title.getCurrentTitle()
else
titleobj = mw.title.new(title)
end
--
local plain = args.plain or ""
if plain:sub(1, 1) == "f" then plain = false else plain = true end
-- get the page content and look for 'text' - return position or nomatch
content = titleobj:getContent()
return mw.ustring.find(content, text, 1, plain) or nomatch -- returns multiple values
end
p.findpagetext = function(frame)
local args = frame.args
local pargs = frame:getParent().args
for k, v in pairs(pargs) do
args[k] = v
end
if not (args[1] or args.text) then return nil end
-- just the first value
return (p._findpagetext(args))
end
return p