Wikimedia blog

News from the Wikimedia Foundation and about the Wikimedia movement

Internationalization and localization

Getting ready for ULS everywhere

The Wikimedia Language Engineering team recently completed their latest development sprint, with a special focus on preparing for the upcoming deployment of the Universal Language Selector (ULS) extension on multiple wikis. The team also hosted a ULS-specific office hour on May 8, 2013 (logs).

ULS deployment prep

The Language Engineering team is working on refining several important features of the Universal Language Selector. This extension will provide an umbrella of services including selection of UI language, input tools and fonts. ULS will superannuate Narayam and Webfonts to provide a unified solution for configuring language settings for MediaWiki. During this development sprint, critical bugs related to positioning of ULS’ activation area and its “cog icon” label were fixed. These affected multiple MediaWiki skins and interlanguage wiki pages. The improved version will be deployed over several phases. More information about the upcoming deployment can be found in the deployment schedule.

ULS testing

ULS features are to be verified based on the test scenarios identified. These scenarios, based on the Cucumber framework, can be adapted for automatic as well as manual testing. The scenarios cover core features of ULS: triggers, language settings panel, display settings, font selection and input tools selection. These have been written in a simple “Given-When-Then” format and provide the steps for easy walkthroughs. The testing instance hosts all the latest updates that are being made. The team is looking for volunteers who can help us with testing and reporting bugs. Let us know if you would like to join and help (write to runa at wikimedia dot org or ping us on #mediawiki-i18n) .

What’s next

The team will be completing all feature changes and testing them by end of the current sprint to be ready for kicking-off the roll-out of phase 1 of ULS. Roll-out will be coordinated by Niklas Laxström with administrators of all scheduled wikis. The team will also be hosting a bug triage session on May 29, 2013 on freenode.net IRC on the #mediawiki-i18n channel.

ULS is live on Commons!

Meanwhile, based on consensus reached by the Commons community, Universal Language Selector and the Translate extensions have been enabled on Commons.

For more details about the Language Engineering projects and ways to participate, please write to me [runa at wikimedia dot org] or ping us on #mediawiki-i18n.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Updates from Language engineering: changes to the Language Selector, new Extension Bundle release

In the recently concluded development sprint, the Wikimedia Language Engineering team made a new release of the Mediawiki Language Extension Bundle (MLEB), fixed bugs related to the Page Translation feature in Translate UX (TUX) and began work on design changes for the Universal Language Selector (ULS). The team also hosted a bug triage session that was well attended.

Input Settings from the ULS Language Settings Panel

Universal Language Selector Design Changes

Development and design changes have been initiated for the Universal Language Selector. The option to position the extension’s main panel in the sidebar was added and this feature is now being polished. Changes to the layout of the Language Settings dialog have been initiated, and usability tests for the proposed design changes were also done.

Using Wikimedia’s default GeoIP locator, ULS can now infer the user’s location and suggest language preferences.

MLEB Release

The April release for the Mediawiki Language Extension Bundle (MLEB) was announced by Amir Aharoni. Starting with this release, MLEB is no longer compatible with MediaWiki 1.19. MLEB 2013.04 and its later versions can only be used with MediaWiki version 1.20.4 or above.

The notable changes include update to CLDR v.23, bug fixes to further stabilize TUX and design changes for the Universal Language Selector. An experimental feature to present a restricted translation environment for new translators was developed for TUX. This is not enabled by default. Basic support for the XLIFF file format has also been added to Translate.

Up Next

During the next development cycle, the team will complete the changes to the Universal Language Selector design and test the features. The team is also participating in Google Summer of Code (GSoC) and the Outreach Program for Women (OPW), and will be working on completing the tasks in the next stages of the programs. More information about the other open projects for internationalization can be found in the master list.

The next Language Engineering office hour will be held on 8 May 2013 at 17:00 UTC (10:00 PDT) in #wikimedia-office on Freenode IRC.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Apply for an internship with the Language engineering team

Quim Gil, the Wikimedia Foundation’s Technical Contributor Coordinator, recently wrote about internship programs that the Wikimedia tech community participates in. These programs provide a valuable platform for a diverse group of contributors and nurture deeper collaboration across open source communities. He also shared details about participating in Google Summer of Code (GSoC) and Outreach Program for Women (OPW) for Wikimedia projects.

The Wikimedia Language Engineering team welcomes students to participate in the projects listed for Google Summer of Code and those listed for the Outreach Program for Women. The projects listed aim to resolve shortcomings or enhance various language tools that the team maintains; they include:

  • improving the jQuery.ime input method library;
  • building browser extensions for stand-alone operation of input methods;
  • creating a dashboard for language coverage information;
  • converting legacy wiki content into translatable entries.

Providing support for nearly 300 languages is no easy feat. There is constant demand for enhancements of tools, and this demand is only expected to grow. The team constantly encourages volunteers including students, language community members and others, to work with them on internationalization challenges. This includes various components like Translate UX (TUX) and Project Milkshake, in which participants can:

  • increase coverage of input methods and font library;
  • improve language rules for the internationalization library;
  • test and prepare validation tools;
  • test and enhance the translation tool;
  • write documents.

They can also contribute by building extensions like SpellingApi and LocalisationUpdate, or even creating usable multi-lingual CAPTCHAs.

Open projects are also added to the master list maintained for all mentorship programs. After ascertaining the availability of mentors, participants can collaborate on a project of interest. If no mentors are listed, students can ask the team on  #mediawiki-i18n (Freenode IRC) or write to me (runa at wikimedia dot org) for more information.

We look forward to all the exciting proposals for our projects for Google Summer of Code and Outreach Program for Women. Student applications close on May 3rd and May 1st respectively.  Time is short — apply now!

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Join the Language Mavens!

Among the Wikimedia projects, Wikipedia has the highest number of individual language projects — 285. The Language Engineering team focuses on building language tools and assets that improve the ability to interact with any article on Wikipedia. Language assets like fonts and input methods are integrated into MediaWiki and its extensions, and our wikis are localized using collaborative translation with translation tools to ensure a decent user experience.

Collaboration in Language Projects and the Language Maven Program

Language Engineering community meetup during GNUnify 2013 at Pune, India

Language tools are constantly evolving to ensure support for our users. It is a slow if not impossible task to scale our small engineering team to support hundreds of languages without close collaboration with our language communities, which have many capable and technically-savvy editors and administrators.

The Wikimedia Language Engineering team has compiled a proposal for the formation of a special interest group named the Language Mavens. With members from various language communities from around the globe, we hope to learn from our users, seek advice, guidance and validation on language features. We hope that the Language Mavens will pull in participation from community members and experts who care about language support features and their adoption in the wikis they read and contribute to.

Getting started with the Maven Program

The Language Maven pilot was rolled out earlier this month on April 13 with a meeting that was well attended. Program scope and activities were discussed. One of the recommendations was to ensure that documents and handy checklists be prepared for easy reference to the language tools available to each language community. Activities that the Mavens can participate in include usability tests, bug triages, testing days and even blogging to share valuable insights about the internationalization tools in their favorite language wiki projects.

The Mavens program is aimed to focus on collecting feedback and providing support for language tools and assets being deployed by the team. This will help develop a long-term user group that will be instrumental in helping other language community members learn more about the latest language features and tools being rolled out. The Maven team expects to meet once every month and communicate through the mediawiki-i18n mailing list. To participate as a Language Maven, please fill up this form to let us know about your interest or ping me (runa at wikimedia dot org) for any questions!

Help us make your language experience better — join the Mavens!

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

A new translation home page, the Maven Program and other updates from Language Engineering

The Wikimedia Language Engineering team continued to develop additional features for Translate UX (TUX) and to fix bugs, including a number of critical bugs that affected Microsoft Internet Explorer. TUX has now been deployed on Wikimedia sites and on translatewiki.net. The team also launched the new outreach program, called Language Mavens, and conducted an office hour.

The view after the user logs in to the new home page of translatewiki.net. This is currently in development.

The view after the user logs in to the new home page of translatewiki.net. This is currently in development.

Redesigning the main page of translatewiki.net

As part of the original specifications for TUX, the translatewiki.net main page is being redesigned. The layout will now include a set of tiles for the project groups that will contain the projects’ logo and statistics. The links conveniently placed on the tiles can be used to access the project messages for translation and proofreading. Once logged in, users will be presented with summarized statistics of their recent activities.

Significant changes are also planned for the account sign-up form. Along with features that will allow newly signed-up users to familiarize themselves with the translation workflow, it’ll make the process of getting the translation permissions smoother. The page will also include artwork to depict cultural diversity from around the world. The main page is currently under preparation, but the special page can already be accessed on translatewiki.net.

Language Mavens: The new outreach program

Language Mavens, the new outreach program for the Language Engineering team, was launched to provide better support to the various language communities within Wikimedia projects. The program was announced during the monthly office-hour (read the logs). After a meeting with the initial group of participants, the pilot phase is now underway. The program aims to regularly connect with the language communities across the Wikimedia projects to get feedback about the user experience and any improvements needed in the language support features. To optimize the internationalization tools and workflows, the program will also facilitate participation in activities such as bug triages and testing days. The team will hold regular meetings with the Maven team members. The program is open for participation, and new members can sign up online. To get more details about the program, you can read the program’s description and write to runa at wikimedia dot org.

Up next

For the next development sprints, the team will continue to work on the projects mentioned above, and also devote significant efforts on redesigning the Universal Language Selector (ULS) interface based on the design review document. The Language Engineering team will host a community bug triage session on April 24th, 2013 at 17:00 UTC. The team also invites students participating in Google’s Summer of Code 2013, to work with them on several internationalization projects.

To know more about our projects and ways to participate, please find us on the IRC channel #mediawiki-i18n (Freenode) or write to runa at wikimedia dot org.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Language Engineering Sprint Update: Translation User Experience improvements, testing and coverage

The Wikimedia Language Engineering team completed its recent development sprints with a focus on feature completeness of ‘Translate User eXperience’ or ‘TUX’ for deployment and also made preparations for its new community outreach project. Below are some of the highlights from the sprint.

Work is underway on the design of the new main page for translatewiki.net

Work is underway on the design of the new main page for translatewiki.net

 

Translate Editor – TUX, has been deployed: The Language Engineering team has been blogging on improvements to the Translate Extension which has now been deployed to WMF sites and been enabled as the default editor on translatewiki.net. Issues encountered while using the new editor can be reported via Bugzilla. More details about TUX’s design features can be found in our previous blog post.

New Language Outreach Program: A new outreach and support program to facilitate feedback from our language wiki projects is also currently in development. The program would facilitate focused feedback about the tools being developed and reaching out with solutions to the larger communities working on Wikimedia projects in various languages. The pilot phase is scheduled for launch in the coming weeks and an announcement with the details of the program will be shortly made.

Updates to the MediaWiki Language Extension Bundle (MLEB): In other news, Amir Aharoni announced the release of the new version of the MediaWiki Language Extension Bundle (MLEB). Besides development updates to TUX, version 2013.3 of MLEB includes bug fixes to the Universal Language Selector (ULS) as well as new fonts for Hebrew, Javanese and Arabic. Last but not least, a Marshallese input method has been contributed to jQuery.ime by Nick Doiron.

Testing plan for language tools: A preliminary outline of a testing plan for all internationalization tools currently maintained by the team was completed. Guided by earlier discussions with the Fedora Localization Testing Group (FLTG), the initial draft includes setup of test environment, preparing test scenarios, collecting feedback and analyzing results.

Preparations for visualization of language coverage information: Work also continued on the Language Coverage Matrix, a collection of data about the availability of language tools for different languages in Wikimedia projects. Currently, the focus is to prepare a technical specification for automated presentation and access to this information.

Event participation and other news: Siebrand Mazeland represented the Wikimedia Language Engineering team at the Internationalization and Localization Conference organized by Lingoport in Santa Clara. (More details: presentation slides, and talk recording)

The ongoing development sprint will focus on creating a new design for the translatewiki.net homepage, launch of the pilot phase for Language support outreach program, and publication of the Language Engineering roadmap for the next fiscal year 2013-2014. The Language Engineering team is also looking to hire JavaScript and PHP engineers with deep experience in i18n and l10n technologies. Additionally, a few of the projects that are open for participation have been listed here.

The Language Engineering team is available to answer any technical questions you may have about the tools it develops. You can join us at our monthly office hour scheduled this month on April 10, 1700 UTC and 1000 PDT or find us on irc.freenode.net at #mediawiki-i18n. Logs from the last office hour held on March 13, 2013 can be found here.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Redesigning the Translation experience: An overview

The Wikimedia Language Engineering team has been regularly reporting updates about improvements to the Translate editor, as part of the “Translate User eXperience” project, or “TUX”. Pau Giner, the team’s UX expert, has also conducted online sessions to talk about these features. If you have missed these updates, here is a summary of what we are changing about the way the Translate editor is used.

Translate UX main editor screen with Spanish translations in List view

The main editor screen of Translate’s new version, with Spanish translations in List view.

Translate is a MediaWiki extension that is used for translating software and wiki pages. Besides providing translations through the web-based editor and proofreading features, it also supports export and import of gettext files for offline translation. The editor provides various features to assist in translation, such as:

  • Message documentation, also known as “context”;
  • Suggestions from translation memory and machine translation;
  • Checking translations for common syntax mistakes;
  • Translation status of messages.

Originally created by Niklas Laxström, this extension has grown in features through contributions made by other contributors, as well as by the Wikimedia Language Engineering team. The extension uses a continuous development model and, if you use the extension on a wiki you administer, you are encouraged to update it periodically using the MediaWiki Language Extension Bundle (MLEB).

The workflow and features for Translate were recently redesigned to provide users with an improved experience. The development was done based upon the designs in the workflow specification document. This included changes in navigation, editor look and feel, translation area, filters, search, and color & style. Here are some of the notable new features and changes:

Editing Modes: The translation editor will now provide two translation modes and one proofreading mode. For translation, the user will be able to choose between the ‘List’ view, more suitable for smaller messages, or the ‘Page’ view, designed for longer pieces of text like paragraphs of a wiki page. The proofreading mode will allow users to view translations by other users and mark their accuracy. Although users can view the messages translated by themselves in this mode, they cannot mark them as accepted.

Message status-based filtering: Users will have the option to select and only view messages that match a filter, depending on their status. In the editor, users can choose an appropriate filter to quickly access ‘Translated’, ‘Outdated’ or ‘Untranslated’ messages in translation mode, and ‘Translated’, ‘Outdated’ and ‘Unproofread’ in proofreading mode. Translations marked as ‘Outdated’ (equal to the jargon term “fuzzy”) need attention, for example because the source message has changed.

Message editor and translation aids: The messages in focus are shown within an editing area that is divided into two separate sections: one for translation, and the other for translation helpers, like context documentation, suggestions from previous translation and external translation services. The layout aims to make optimal use of available space and also provides users with the additional option to focus better on a message by expanding the size of the editing area to the entire width of the editor. The navigation to the next message, the ability to save drafts and the display of warnings make the translation process more fluent. Development of some exciting features for improving context-related translation aids is also on the cards.

Search and edits: Users can search translatable strings using the search field at the top of the edit section. The search results are displayed within various categories like ‘source’ or ‘translated’ messages. An additional overview displays the languages and message groups where they occur and users can further filter them based on the sub-groups. Users will be able to directly go into ‘Translation mode’ to make changes to the messages in the search results. A navigation arrow can bring them back to the list of results.

Not all of these features are available on Wikimedia wikis yet, but they will be soon. The current development version is available on translatewiki.net. If the new editor is not visible, appending “&tux=1” to the URL will enable the new features. Appending “&tux=0” will disable them.

While redesigning Translate’s User experience has been a significant project, development is continuously carried out to make the extension even better to use. And for this, we are always looking for valuable feedback from our users. Bugs and features requests can be filed through bugzilla; additionally, one can write to me at runa at wikimedia dot org with their feedback and suggestions.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

New release of the MediaWiki Language Extension Bundle, and other updates

Highlights from the latest development sprint of the Language Engineering team include the release of a new version of the MediaWiki Language Extension Bundle, and continued progress on Translation User Experience (UX) and the Language Coverage Matrix.

Screenshot for the redesigned proofread view for the Translate extension showing translations in Georgian.

Screenshot of the redesigned proofread view for the Translate extension showing translations in Georgian.

Design and development improvements continued for Translate UX, also known as TUX. A preliminary implementation of the Proofreading feature (per the specifications in the design document) includes features to view the messages adjacently, adding clickable markers for proofreading and switching between proofreading and translation mode. Pau Giner presented these updates at an open session and also invited users to join the ongoing usability tests.

Amir Aharoni announced the release of MediaWiki Language Extension Bundle (MLEB) 2013.02. Besides localization updates in most of the components within MLEB, more features were added to Translate UX. The Universal Language Selector however had to be rolled back to the 2012.12 version to ensure compatibility with MediaWiki 1.20.

The Language coverage matrix document was updated to include more information about web fonts and input methods that are currently available for use in MediaWiki and Wikimedia projects. The document aims to provide an overview of the internationalization and localization support in languages across Wikimedia projects.

As part of the ongoing effort to use a CLDR-based, data-driven approach for internationalization features, plural rules for many languages were analyzed and custom rules were removed for a few languages.

The Language Engineering team will be hosting an IRC office hour session on Wednesday, March 13 2013 on in #wikimedia-office (FreeNode server) at 17:00 UTC. Topics will include discussion, questions, feedback about current projects, open bugs and projects planned for the next sprint.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Language engineers improve translation tool and meet with their peers

Quem não arrisca não petisca — a Portuguese proverb

During their latest development sprint, the Wikimedia Language Engineering team conducted extensive review and testing of the Translate extension, and participated and contributed to two major open source events in India: a core developers Language Summit and GNUnify.

add caption here

User experience improvements to the Translate tool will notably make it easier and more pleasant to translate content on Wikimedia sites that use it.

Translate Editor Updates

Progress continued on enhancements to the MediaWiki Translate extension. Further testing on the usability of the translation editor, search feature, and prototype of the advanced editing features were conducted by Pau Giner with five users from four different countries. The prototypes were tested in a great diversity of languages including Nepali, Chinese, Tetum, French, Breton, and Finnish. Based on this feedback, changes to the style and specifications for the prototype were made. Details about the individual tests can be found in the final report for this round of testing.

Community Participation

The Language Engineering team participated in the Open Source Language Summit and GNUnify, both held in Pune, India. The Open Source Language Summit, co-organized by the Wikimedia Foundation and Red Hat, consisted of work-sprints that focused on internationalization (i18n) and localization (l10n) features, font support, input method tools, language search, i18n testing methods and standards. More information about the event is available in the detailed event report.

The team also participated in GNUnify 2013, held at the Symbiosis Institute of Computer Studies and Research, in Pune. Besides presenting about the various projects that the team is currently working on, a translation sprint on translatewiki.net was also organized, as well as a workshop on jQuery.IME and a BoF session to discuss issues related to Wikimedia projects in Indian languages. Details of the accomplishments from the sessions at GNUnify 2013 can be found in the event report.

Other Achievements

Additionally, some changes to MediaWiki core were backported to support the newer version of the Universal Language Selector on MediaWiki versions 1.19 and 1.20. As there is no released maintenance version yet, MediaWiki Language Extension Bundle (MLEB) users are advised to remain on MLEB version 2012.12.

Focus for the next sprint

For the next development sprint, the team will work on more features for the Translate extension, like the proofreading mode and further improving the user experience. In addition to this, focus will be on putting together the language coverage matrix as a reference for the status of language support on MediaWiki, MediaWiki Extensions and Wikimedia projects.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Language Engineering team participates in GNUnify 2013

Det vatten du hämtar ur bäcken lär dig känna källan – a Swedish Proverb.

GNUnify is an annual gathering consisting of workshops, talks & seminars, held to help increase the awareness of free and open-source software in India.

GNUnify is an annual gathering consisting of workshops, talks & seminars, held to help increase the awareness of free and open-source software in India.

The Wikimedia Language Engineering team participated in GNUnify 2013 held in Pune, India on February 15–17. The team presented their work, conducted a translation sprint, organized workshops and also participated in discussions with local Wikipedians about using MediaWiki and Wikimedia projects in their languages.

Presentations by the team

Runa Bhattacharjee presented about the changing dynamics in the adoption of localized content and the need for developing tools that facilitate new demands. She introduced the projects that the Language Engineering team is working on. Siebrand Mazeland and Niklas Laxström gave a walkthrough of the MediaWiki Translate extension and the translatewiki.net platform, and showcased the new design and features of the updated translation editor.

Santhosh Thottingal presented how the jQuery libraries of Project Milkshake can be used to prepare multilingual web applications for internationalization; he also presented a tutorial on their use. Amir Aharoni demonstrated the easy use of the input methods provided by the jQuery.IME library and how to contribute using phonetic keymaps. He encouraged use in web applications of the currently more than 140 input methods of the library. Yuvaraj Pandian demonstrated how he ported jQuery.IME for use in Android devices.

Alolita Sharma spoke about technologies and tools that help contributing to Wikipedia in various languages. She highlighted the need for features and tools to support non-English Wikipedias and the solutions that the Language Engineering team is developing that would help eliminate fundamental hindrances that contributors face while trying to create content for Indian languages. She also spoke about the other Wikimedia projects that are open for participation.

Workshops

Amir Aharoni conducted a workshop on the jQuery.IME library, in which he demonstrated the procedure to add a new input method and submit it for inclusion on GitHub. A two-hour translation sprint was conducted in which almost 40 participants translated various projects hosted on translatewiki.net. At the end of the session, more than 1000 completed translations were logged and prizes were distributed for the most significant contributions. Yuvaraj Pandian, Sucheta Ghoshal and Harsh Kothari conducted a workshop on building MediaWiki gadgets. Participants were introduced to the process of creating gadgets using JavaScript and CSS, and making them available for other users.

Language Engineering BOF session

The Language Engineering team also organized a session to discuss technical issues related to Wikimedia projects in Indian languages, which was attended by local Wikipedians. Issues related to following up on internationalization and localization bugs and building local technical user groups were discussed.

To conclude, participation in open source conferences such as GNUnify helps get more open source developers as well as language Wikipedians aware of the latest tools that the Language Engineering team is developing which they can use as well as receive direct feedback from the global communities the team serves.

More information can be found in the detailed report.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering