Evan Martin (evan) wrote in evan_tech,
Evan Martin
evan
evan_tech

two more wikipedia remarks

I was thinking about some toy projects for Wikipedia data again and I poked around some Wikipedia code. Two observations:
  1. Good'ol timwi strikes again:
    Revision 1.160 - (view) (download) (annotate) - [select for diffs]
    Sun Aug 1 01:57:26 2004 UTC (5 months, 1 week ago) by timwi
    Branch: MAIN
    Changes since 1.159: +4 -7 lines
    Diff to previous 1.159
    Since nobody commented on this, I assume all the changes are OK
    (http://mail.wikipedia.org/pipermail/wikitech-l/2004-July/011789.html)
    Use wfMsg's own replacement functionality instead of str_replace ( "$1", ... )
    What a great develoment model. :(
  2. From a recent change:
    BUG#1317 - External links in image caption not working
    Changed regexp to allow single ] inside of an image's alternative text
    I know it's against the spirit of wikipedia to actually have some careful design go into it, but they would've saved so much pain had they designed wikitext as a regular language instead of a pile of regular expressions. Since they allow HTML in entry bodies, the only way to properly display wiki data is with an HTML browser. :(
Tags: grumpy, wikipedia
Subscribe

  • dremel

    They published a paper on Dremel, my favorite previously-unpublished tool from the Google toolchest. Greg Linden discusses it: "[...] it is capable…

  • google ime

    Japanophiles might be interested to learn that Google released a Japanese IME. IME is the sort of NLP problem that Google is nearly uniquely…

  • ghc llvm

    I read this thesis on an LLVM backend for GHC, primarily because I was curious to learn more about GHC internals. The thesis serves well as an…

  • Post a new comment

    Error

    default userpic
    When you submit the form an invisible reCAPTCHA check will be performed.
    You must follow the Privacy Policy and Google Terms of use.
  • 14 comments

  • dremel

    They published a paper on Dremel, my favorite previously-unpublished tool from the Google toolchest. Greg Linden discusses it: "[...] it is capable…

  • google ime

    Japanophiles might be interested to learn that Google released a Japanese IME. IME is the sort of NLP problem that Google is nearly uniquely…

  • ghc llvm

    I read this thesis on an LLVM backend for GHC, primarily because I was curious to learn more about GHC internals. The thesis serves well as an…