Jump to content

MediaWiki talk:Usernameblacklist: Difference between revisions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Content deleted Content added
Line 126: Line 126:
::If it's the very same character or sequence, 10 times is a huge number, why not 4 or 5? —[[User:Centrx|Centrx]]→[[User talk:Centrx|''talk'']] • 22:45, 29 March 2007 (UTC)
::If it's the very same character or sequence, 10 times is a huge number, why not 4 or 5? —[[User:Centrx|Centrx]]→[[User talk:Centrx|''talk'']] • 22:45, 29 March 2007 (UTC)
::Difficult I may say, as we really don't know what group to match. <sub>→[[User:AzaToth|<span style="color:#773">Aza</span>]][[User_talk:AzaToth|<span style="color:#359">Toth</span>]]</sub> 22:46, 29 March 2007 (UTC)
::Difficult I may say, as we really don't know what group to match. <sub>→[[User:AzaToth|<span style="color:#773">Aza</span>]][[User_talk:AzaToth|<span style="color:#359">Toth</span>]]</sub> 22:46, 29 March 2007 (UTC)

==For future requests==
Once the basic list has been established, does anyone think that it's worth it to create the page [[MediaWiki talk:Usernameblacklist/Requests for addition and removal]]? Thus, if there are any false positives, we can make a piped link to that page from [[MediaWiki:Blacklistedusernametext]], so that newbies can say "Hey, I have [insert perfectly acceptable name]" somewhere. Or if there's an influx of vandals with a certain offensive username pattern, it can be stopped. (Although the blacklist should not be used as a temporary solution to anything, I think.) [[User:Gracenotes|<font color="#960">Grace</font><font color="#000">notes</font>]]<sup>[[User talk:Gracenotes|<font color="#960">T</font>]]</sup> § 22:55, 29 March 2007 (UTC)

Revision as of 22:55, 29 March 2007

Be careful about names listed here that may prohibit unintended usernames. For example, "EmbarassedMonkey" because it matches "ass".

We've got an entire article on this situation: see Scunthorpe Problem. --Carnildo 18:50, 29 March 2007 (UTC)[reply]

Documentation: mw:Extension:Username_Blacklist



Working?

I'm not having much luck using this page (see my most recent username creations). If you have a chance, take a look, I must be doing something completely wrong. See here for documentation. alphachimp 05:24, 29 March 2007 (UTC)[reply]

Doesn't appear to work. --Jeffrey O. Gustafson - Shazaam! - <*> 05:35, 29 March 2007 (UTC)[reply]
As an admin, you can override it, but as anon, it works. AzaToth 17:26, 29 March 2007 (UTC)[reply]
Just tested this and confirmed it. alphachimp 17:31, 29 March 2007 (UTC)[reply]
Hmm... WP:POINT? :-P Maybe MediaWiki needs some time to refresh the content of this list. -- ReyBrujo 05:38, 29 March 2007 (UTC)[reply]

Proposed edits

The list we have here is great, but it doesn't take advantage of... shall we say, the magic of regexes. I suggest the following list, and also suggest that someone who has worked with regexes for longer than I have take a look before implementing this. This blacklist is extremely powerful, in the sense that it allows no exceptions. I have them numbered here, but that should be changed to bullets:

  1. \bass?!(ess|oc|yr|em|is)
    Perhaps add the British spelling, but similar provisions (for assess, association, assyrian, assembly, assistance, etc.) would have to be made. Either that, or we could take the approach of only filtering out certain permutations -- when followed by nothing, "hole", "hat" (maybe), etc. GracenotesT § 19:49, 29 March 2007 (UTC)[reply]
    "ass" is just too short and simple. Even with \bass\b it could be appropriate to appear some where ("Bob the ass assin"?) —Centrxtalk • 21:37, 29 March 2007 (UTC)[reply]
    That name might be blocked for containing sexual innuendo, even unintentional. With what's here, "assassin" would not be allowed, but that can be fixed by adding "|assin". GracenotesT § 21:48, 29 March 2007 (UTC)[reply]
    There are many acceptable usernames that might contain the string "ass", for any letter of the alphabet that might follow, enough that \bass\b would be the only possibly appropriate blacklist entry. —Centrxtalk • 21:56, 29 March 2007 (UTC)[reply]
    Hm. As a compromise, how about \bass(hole|hat|es)?\b ? GracenotesT § 22:01, 29 March 2007 (UTC)[reply]
    That seems pretty safe. —Centrxtalk • 22:17, 29 March 2007 (UTC)[reply]
  2. fu[c(]k
    sh[ia]t - William Shatner :) --Conti| 19:38, 29 March 2007 (UTC)[reply]
    \bsh[ia]t(ter|ting|e)?\b GracenotesT § 19:47, 29 March 2007 (UTC)[reply]
    Gah, that means that "shatter" won't work. Getting rid of [ia], and replacing it with "i", may be good enough. Failing that, there can just be two items. GracenotesT § 20:15, 29 March 2007 (UTC)[reply]
    "Shatter" is a perfectly fine word, anyways. ;) --Conti| 20:25, 29 March 2007 (UTC)[reply]
    \ban(us|al) - Analog, analyze.. -- Conti| 19:38, 29 March 2007 (UTC)[reply]
    \ban(us|al)\b -- GracenotesT § 19:47, 29 March 2007 (UTC)[reply]
    Well, there's always the expression "Don't be anal", but I think we can live with not allowing that. --Conti| 19:49, 29 March 2007 (UTC)[reply]
  3. \ban(us|al)\b
  4. phall(us|ic)
    fag - Too much potential collateral damage. --Conti| 19:38, 29 March 2007 (UTC)[reply]
  5. faggot
  6. dick(s|head)?\b
  7. cunt
  8. slut
    suck - Mostly harmless. --Conti| 19:38, 29 March 2007 (UTC)[reply]
  9. \blub(ric|es?\b)
    cock - See this. --Conti| 19:38, 29 March 2007 (UTC)[reply]
  10. cock(s|sucker)?\b
  11. vaginal?
  12. scrotum
    dildo - Not offensive, IMHO. We should only cover clear policy violations here. --Conti| 19:38, 29 March 2007 (UTC)[reply]
    "Dildo" is currently on the list. I was thinking of adding it in order to prevent attack usernames, i.e. "Fuck Gracenotes with a dildo", or similar. Plus, if someone is thinking about sex as they create a user account... well, no solid conclusions there, but still. The username policy restricts even sexual innuendo... this is a sex toy; rather explicit. Compare this to the spam blacklist. There's only one way to link to a site, but here, there are many many usernames to choose from. GracenotesT § 20:48, 29 March 2007 (UTC)[reply]
  13. Gracenotes is not a sick individual
  14. on wheels
    colbert - Could be someone's name. --Conti| 19:38, 29 March 2007 (UTC)[reply]
    admin - (collateral - User:Padminiraman, User:BadmintonL, User:Breadmine?, User:Toadminor?, User:Dreadminus ...)
    Thank for pointing that out. The word can be isolated, then: \badmin(istrator)?\b. GracenotesT § 20:21, 29 March 2007 (UTC)[reply]
  15. \badmin(istrator)?\b
  16. banned
  17. sysop
  18. steward
  19. username policy
    idiot - User:I'm not an idiot would be allowed. --Conti| 19:38, 29 March 2007 (UTC)[reply]
    stupid - User:I'm not stupid would be allowed. --Conti| 19:38, 29 March 2007 (UTC)[reply]
  20. \bhate
  21. \.(com|org|co.uk|net|info)(\b|/)
    By the way: right now we have \.(com|org|co\.uk|net|info)\b. However, someone would still be able to register an account with the name www.somespam.com/main.php. This is why there's the (\b|/), or even [\b/]. GracenotesT § 22:21, 29 March 2007 (UTC)[reply]
    Also, I suggest these 20:52, 29 March 2007 (UTC), per WP:U#Wikipedia:
  22. Wikipedia
  23. Wikiquote
  24. Wiktionary
  25. Wikibooks
  26. Wikiversity
  27. Wikisource
  28. Wikinews

That's it... any other suggestions would be good. GracenotesT § 19:23, 29 March 2007 (UTC)[reply]

I've commented a few of those out for numerous reasons, mostly since there'd be some collateral damage. --Conti| 19:38, 29 March 2007 (UTC)[reply]
How about changing "cock" to "cock(?!er)", since "cocker" and "cockerspaniel" are legit? GracenotesT § 19:41, 29 March 2007 (UTC)[reply]
Oof. then there's cockatoo, cockle, cockroach... meh. Maybe just have it restricted to "\bcock(s|sucker)?\b", then. GracenotesT § 19:45, 29 March 2007 (UTC)[reply]
I would appreciate it if you improved them rather than removed 'em. Namely, "\ban(us|al)" could become "\ban(us|al)\b"... etc. with other items. GracenotesT § 19:42, 29 March 2007 (UTC)[reply]
Sorry, I don't know much about regular expression (but I'm learning fast :)), so I didn't knew how to correctly create such expressions. "cock(?!er)" would mean that everything that includes "cock" would be blacklisted, unless it's "cocker", right? If so, that's fine by me. Okay, it's not, just saw the other examples above. --Conti| 19:48, 29 March 2007 (UTC)[reply]
Yep! And this may of interest to you, if that's the sort of thing you're interested in. GracenotesT § 19:51, 29 March 2007 (UTC)[reply]
Thanks! Well, I'm mostly interested in creating as less collateral damage as possible with this list. ;) --Conti| 19:53, 29 March 2007 (UTC)[reply]
Me too. By the way, "cock(s|sucker)?\b" will restrict "cock", "cocks", and "cocksucker". GracenotesT § 19:54, 29 March 2007 (UTC)[reply]

bot?

Someone want to add bot to the list?--VectorPotentialTalk 20:26, 29 March 2007 (UTC)[reply]

That would be an effective way of stopping legitimate bots from getting accounts :) GracenotesT § 20:28, 29 March 2007 (UTC)[reply]
And here I was, thinking about words like "Abbot" and "botanic".. :) --Conti| 20:30, 29 March 2007 (UTC)[reply]
Ok, so I didn't think that through very far--VectorPotentialTalk 20:33, 29 March 2007 (UTC)[reply]

Good resource

Check out User:Lupin/badwords for a pretty exhaustive list of profanity. alphachimp 22:29, 29 March 2007 (UTC)[reply]

It is extremely exhaustive, except I don't think we need to restrict burp(er|ing)s? :) GracenotesT § 22:36, 29 March 2007 (UTC)[reply]
Unless people are actively creating usernames with these words, I think it would be a waste of effort (and possibly server processing time) to add them all. —Centrxtalk • 22:38, 29 March 2007 (UTC)[reply]
Heh, it's just a nice starting point, I think. alphachimp 22:40, 29 March 2007 (UTC)[reply]

Case sensitive

A note, the regex are all case sensitive. you can force a particular regex as case insensitive. Extracted from http://se.php.net/manual/en/reference.pcre.pattern.syntax.php AzaToth 20:28, 29 March 2007 (UTC)[reply]

I would assume that it was already case insensitive, as two equivalent usernames (ignoring case) can't be registered. Maybe that caveat wasn't caught, however, in the code for this extension. GracenotesT § 20:30, 29 March 2007 (UTC)[reply]
The code specifies: return count( $groups ) ? '/(' . implode( '|', $groups ) . ')/u' : false; AzaToth 20:33, 29 March 2007 (UTC)[reply]

Repetitions in usernames

How about adding this:

(.+)\1{9}

A character (or sequence of characters) repeated at least 10 times. Миша13 22:21, 29 March 2007 (UTC)[reply]

If it's the very same character or sequence, 10 times is a huge number, why not 4 or 5? —Centrxtalk • 22:45, 29 March 2007 (UTC)[reply]
Difficult I may say, as we really don't know what group to match. AzaToth 22:46, 29 March 2007 (UTC)[reply]

For future requests

Once the basic list has been established, does anyone think that it's worth it to create the page MediaWiki talk:Usernameblacklist/Requests for addition and removal? Thus, if there are any false positives, we can make a piped link to that page from MediaWiki:Blacklistedusernametext, so that newbies can say "Hey, I have [insert perfectly acceptable name]" somewhere. Or if there's an influx of vandals with a certain offensive username pattern, it can be stopped. (Although the blacklist should not be used as a temporary solution to anything, I think.) GracenotesT § 22:55, 29 March 2007 (UTC)[reply]