加入收藏 | 设为首页 | 会员中心 | 我要投稿 李大同 (https://www.lidatong.com.cn/)- 科技、建站、经验、云计算、5G、大数据,站长网!
当前位置: 首页 > 百科 > 正文

swift – 奇怪的String.unicodeScalars和CharacterSet行为

发布时间:2020-12-14 04:39:35 所属栏目:百科 来源:网络整理
导读:我正在尝试使用 Swift 3 CharacterSet来过滤字符串中的字符,但是我很早就陷入了困境. CharacterSet有一个名为contains的方法 func contains(_ member: UnicodeScalar) - Bool Test for membership of a particular UnicodeScalar in the CharacterSet. 但测
我正在尝试使用 Swift 3 CharacterSet来过滤字符串中的字符,但是我很早就陷入了困境. CharacterSet有一个名为contains的方法

func contains(_ member: UnicodeScalar) -> Bool
Test for membership of a particular UnicodeScalar in the CharacterSet.

但测试这不会产生预期的行为.

let characterSet = CharacterSet.capitalizedLetters

let capitalAString = "A"

if let capitalA = capitalAString.unicodeScalars.first {
    print("Capital A is (characterSet.contains(capitalA) ? "" : "not ")in the group of capital letters")
} else {
    print("Couldn't get the first element of capitalAString's unicode scalars")
}

我得到资本A不在大写字母组中,但我期望相反.

非常感谢.

解决方法

CharacterSet.capitalizedLetters
返回一个字符集,其中包含Unicode General Category Lt aka“Letter,titlecase”中的字符.那是
“包含大写字母后跟小写字母的连字符(例如,?,?,?和?)”(比较 Wikipedia: Unicode character property或
Unicode? Standard Annex #44 – Table 12. General_Category Values).

你可以在这里找到一个列表:Unicode Characters in the ‘Letter,Titlecase’ Category.

您也可以使用来自的代码
NSArray from NSCharacterset转储该字符的内容
组:

extension CharacterSet {
    func allCharacters() -> [Character] {
        var result: [Character] = []
        for plane: UInt8 in 0...16 where self.hasMember(inPlane: plane) {
            for unicode in UInt32(plane) << 16 ..< UInt32(plane + 1) << 16 {
                if let uniChar = UnicodeScalar(unicode),self.contains(uniChar) {
                    result.append(Character(uniChar))
                }
            }
        }
        return result
    }
}

let characterSet = CharacterSet.capitalizedLetters
print(characterSet.allCharacters())

// ["?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?","?"]

你可能想要的是CharacterSet.uppercaseLetters

Returns a character set containing the characters in Unicode General Category Lu and Lt.

(编辑:李大同)

【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容!

    推荐文章
      热点阅读