Wikipedia Survey First Results



Similar documents
Victims Compensation Claim Status of All Pending Claims and Claims Decided Within the Last Three Years

1.- L a m e j o r o p c ió n e s c l o na r e l d i s co ( s e e x p li c a r á d es p u é s ).

Transient Voltage Suppressor SMBJ5.0 - SMBJ440CA

Future Trends in Airline Pricing, Yield. March 13, 2013

B I N G O B I N G O. Hf Cd Na Nb Lr. I Fl Fr Mo Si. Ho Bi Ce Eu Ac. Md Co P Pa Tc. Uut Rh K N. Sb At Md H. Bh Cm H Bi Es. Mo Uus Lu P F.

Anti-Money Laundering Distributor Due Diligence

Chem 115 POGIL Worksheet - Week 4 Moles & Stoichiometry Answers

DHL EXPRESS CANADA E-BILL STANDARD SPECIFICATIONS

Chem 115 POGIL Worksheet - Week 4 Moles & Stoichiometry

OHIM SEARCH TOOLS: TMVIEW, DSVIEW AND TMCLASS. Making trade mark and design information readily available for users

Client-IP EDNS Option Concerns

Key Financial Secrecy Indicator 5: Public Company Accounts

Put the human back in Human Resources.

CLASS TEST GRADE 11. PHYSICAL SCIENCES: CHEMISTRY Test 6: Chemical change

SCO TT G LEA SO N D EM O Z G EB R E-

All answers must use the correct number of significant figures, and must show units!

Extended Service Contracts

Opis przedmiotu zamówienia - zakres czynności Usługi sprzątania obiektów Gdyńskiego Centrum Sportu

Acceptance Page 2. Revision History 3. Introduction 14. Control Categories 15. Scope 15. General Requirements 15


1. Oblast rozvoj spolků a SU UK 1.1. Zvyšování kvalifikace Školení Zapojení do projektů Poradenství 1.2. Financování

European Research Council

CODES FOR PHARMACY ONLINE CLAIMS PROCESSING

ELECTRON CONFIGURATION (SHORT FORM) # of electrons in the subshell. valence electrons Valence electrons have the largest value for "n"!

Campus Sustainability Assessment and Related Literature

Microsoft survey on enterprise social use and perceptions

Vehicle Identification Numbering System 00.03

BIS CEMLA Roundtable on Fiscal Policy, public debt management and government bond markets: issues for central banks

Excel Invoice Format. SupplierWebsite - Excel Invoice Upload. Data Element Definition UCLA Supplier website (Rev. July 9, 2013)

Earthquake Hazard Zones: The relative risk of damage to Canadian buildings

NOTICE TO MEMBERS No May 10, 2005

Title (fr) SOURCE IONIQUE INTERNE DOUBLE POUR PRODUCTION DE FAISCEAU DE PARTICULES AVEC UN CYCLOTRON

Annex A to the MPEG Audio Patent License Agreement Essential Philips, France Telecom and IRT Patents relevant to DVD-Video Player - MPEG Audio

d e f i n i c j i p o s t a w y, z w i z a n e j e s t t o m. i n. z t y m, i p o jі c i e t o

BLADE 12th Generation. Rafał Olszewski. Łukasz Matras

HR DEPARTMENTAL SUFFIX & ORGANIZATION CODES

European Research Council

T c k D E GR EN S. R a p p o r t M o d u le Aa n g e m a a k t o p 19 /09 /2007 o m 09 :29 u u r BJB M /V. ja a r.

From Quantum to Matter 2006

Using Predictive Modeling to Reduce Claims Losses in Auto Physical Damage

I n la n d N a v ig a t io n a co n t r ib u t io n t o eco n o m y su st a i n a b i l i t y

E. OZCEYLAN /International Journal of Lean Thinking Volume 1, Issue 1 (June 2010)

TEPZZ 87_546A T EP A2 (19) (11) EP A2 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: G05B 19/05 ( )

MM, EFES EN. Marc Mathieu

Electronegativity and Polarity


Payor Sheet for Medicare Part D/ PDP and MA-PD

Our patent and trade mark attorneys are here to help you protect and profit from your ideas, making sure they re working every bit as hard as you do.

5 to 220 Volt DEVICES FOR BIPOLAR APPLICATIONS

DATING YOUR GUILD

Export of unemployment benefits

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: G06Q 40/04 ( )

USI Master Policy Information

Rain Sensor "AWS" TYPE CHART and INSTALLATION INSTRUCTION

It takes four quantum numbers to describe an electron. Additionally, every electron has a unique set of quantum numbers.

The EU Energy Tax Directive: overview about the proposed reform, impacts on national measures and state of play

i n g S e c u r it y 3 1B# ; u r w e b a p p li c a tio n s f r o m ha c ke r s w ith t his å ] í d : L : g u id e Scanned by CamScanner

H ig h L e v e l O v e r v iew. S te p h a n M a rt in. S e n io r S y s te m A rc h i te ct

11 + Non-verbal Reasoning

U.S. Department of Housing and Urban Development: Weekly Progress Report on Recovery Act Spending

Axioma Risk Monitor Global Developed Markets 29 June 2016

WorldSkills Leipzig July 2013 Days to go 298


Frederikshavn kommunale skolevæsen

SESAR. Luftfahrttechnologie - Auftaktveranstaltung zum 7. EU-Forschungsrahmenprogramm Wien, 4 Dezember 2006


TEPZZ 6_Z76 A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.:

Child Care Resource Kit celebrate relationships!

TEPZZ 9 Z5A_T EP A1 (19) (11) EP A1. (12) EUROPEAN PATENT APPLICATION published in accordance with Art.

Tape & Reel Packaging For Surface Mount Devices. Date Code Marking:

Information and Communications Technologies (ICTs) in Schools

MT 103+ Single Customer Credit Transfer

EUROPEAN CIVIL RPAS OPERATORS FORUM

SURVEY ON THE TRAINING OF GENERAL CARE NURSES IN THE EUROPEAN UNION. The current minimum training requirements for general care nurses

User language preferences online. Analytical report

FH Studies Collaboration Lecturers at the European Society of Atherosclerosis Congress Pre- and Post- Event Questionnaires

Professor Heikki Ervasti University of Turku Department of Social Research

The Transition to Tendering Perspective from the Manufacturing Industry

Attachment "A" - List of HP Inkjet Printers

ORIGIN-2 ORIGIN-1 ORIGIN-3 TRANSPORTATION UNITS DESTINATION-1 DESTINATION-2 DESTINATION-3 COMMUNICATION FUZZY INFERENCE ENGINE RULE BASE

SME Instrument statistics


INFORMATION about transfer orders incurring Extra transfer fees for transfer orders with missing or incorrect data

tis, cis cunc - cunc - tis, cis tis, cis cunc - tis, func - def - def - tis, U func - def - func - tis, pa - tri pa - tri pa - tri tu - per - tu -

Japan Export Air. International Air Freight Fuel Surcharge. All Destinations

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: G06F 21/64 ( )

egovernment Digital Agenda Scoreboard 2014

Where People Search for Jobs:


NAAUSA Security Survey

UK ResiEMEA Version 2.0.0

ARTICLE IN PRESS. International Journal of Lean Thinking Volume 1, Issue 1 (June 2010)

Your first EURES job. Progress Summary 2014Q4. March 2015

2335-Hand-out. Workshop on Entrepreneurship for Physicists and Engineers from Developing Countries April Presentation

QATAR CENTRAL BANK INTERNATIONAL BANK ACCOUNT NUMBER (IBAN) - QATAR STANDARDS DOCUMENT. E-Banking Instructions IBAN Standard Annex No.

Grade Boundaries. Edexcel GCE AS/A level and Applied GCE

European developments in VET Quality Assurance

100% ionic compounds do not exist but predominantly ionic compounds are formed when metals combine with non-metals.

telephone television internet 3-play (i + t + tv) 2-play 2-play (i+t) (t+tv) 2-play (i+tv)

Transcription:

Wikipedia Survey First Results Working Draft Current Version: 0.3 Date: 9 April 2009 Ruediger Glott (UNU-MERIT), glott@merit.unu.ed Philipp Schmidt (UNU-MERIT), schmidt@merit.unu.edu Rishab Ghosh (UNU-MERIT) This file contains an ongoing work in progress. We will add further analysis of the Wikipedia survey data to it periodically. This document is not suitable for citation. Do not distribute without clearly indicating that it represents preliminary results that might be updated in the future. Please contact us if you have any questions. QUESTIONS DEALT WITH IN THIS REPORT (numbering refers to questionnaire) Question A1 gender Question A2 Age Question A7 - tertiary education Question C14 - If you do note donate, why not? Question D1A - Why don't you contribute to Wikipedia? Question D1B - I would be much likelier to contribute if.. QUESTIONS OF HIGH INTEREST Question A4 Country Question A9 - has partner Question A10 - has children Question B13 - How many hours? Question B17 - Why do you contribute? 1

Data Overview The following sections present first results of the first global Wikipedia Survey, based on a debugged database (i.e cleaned from erroneous entries such as extreme old or young ages, or evidently wrong combination of responses, such as having a PhD but being 14 years of age, and the like). For the purpose of this first data analysis we have also removed the response to the Russian version of the survey because it is still unclear why this group of the Wikipedians is overrepresented. These data will be re-integrated and analysed after we have evidence that the Russian survey has not been manipulated (so far we did not find any hints to that, e.g. different records from the same IP address or similar answering patterns, but this examination is ongoing at the moment). Overall, the database that has been used for this first overview contains 130,576 cases. Wikipedia Language Versions Table 1 shows the relative shares of the different language versions in the total response. The lion share of responses was provided by users of the English Wikipedia version, followed by the German and Spanish version. +-------+--------+---------+ N Lang % +-------+--------+---------+ 43912 en 33.6295 22989 de 17.6058 20144 es 15.4270 8706 nl 6.6674 6155 pt 4.7137 5082 ja 3.8920 3613 zhhant 2.7670 3569 fr 2.7333 3516 it 2.6927 3427 zhhans 2.6245 2907 pl 2.2263 2319 cs 1.7760 1567 ar 1.2001 969 th 0.7421 598 vi 0.4580 562 ca 0.4304 213 id 0.1631 154 ta 0.1179 113 el 0.0865 47 eo 0.0360 14 af 0.0107 +-------+--------+---------+ N = 130576 Table 1: Respondents by Wikipedia language version 2

Countries Table 2 provides an overview of the countries where the respondents live, illustrating that Wikipedia is used in at least 228 countries in the world. 1 +-------+---------+-------+ N Country % +-------+---------+-------+ 20084 DE 15.46 18413 US 14.17 6842 NL 5.27 6010 BR 4.63 5996 ES 4.61 5431 GB 4.18 5128 JP 3.95 4946 MX 3.81 3693 IT 2.84 3450 CA 2.66 3416 FR 2.63 3175 AR 2.44 2952 PL 2.27 2848 PG 2.19 2531 AU 1.95 2431 CZ 1.87 2399 BE 1.85 2252 TW 1.73 2009 AT 1.55 1996 IN 1.54 1828 CL 1.41 1690 CO 1.30 1479 SE 1.14 1417 CH 1.09 1048 VE 0.81 989 TH 0.76 977 HK 0.75 875 PE 0.67 643 NZ 0.49 627 CN 0.48 575 VN 0.44 561 EG 0.43 530 IE 0.41 499 IL 0.38 477 PT 0.37 440 SA 0.34 437 FI 0.34 379 RO 0.29 368 NO 0.28 343 SG 0.26 331 GR 0.25 312 UY 0.24 295 DK 0.23 284 ID 0.22 275 CR 0.21 265 ZA 0.20 260 PH 0.20 257 MY 0.20 217 EC 0.17 211 DO 0.16 N: 51-200 HU, AE, RU, SV, GT, TR, RS, SK, PK, HR, PR, DZ, BO, BG, PA, JO, PY, EE, MA, KW, SI, HN, LU, UA, LT, KR, BA, LK, IR, NI, MG, BH, IS, N: 11-50 PS, CI, MO, GE, SY, TN, LV, LB, BD, IQ, AD, QA, OM, TT, MT, LY, CY, CU, AL, MK, NG, MU, NP, UM, JM, KE, SD, BY, IM, YE, AW, AQ, AN, LI, MD, AX, ZW, RE, FJ, MC, N: <= 10 AO, BM, AS, BZ, VI, AF, UG, BN, BB, MV, GH, SN, AZ, DM, BS, CD, JE, AM, GP, MN, BF, KH, BW, VG, NC, BT, GI, EH, ME, ZM, PF, ET, DJ, CV, KN, VA, KY, GL, GG, CF, AG, MZ, SR, CM, BJ, BI, KZ, MM, MQ, KP, AI, GU, FK, BL, LC, FO, CX, WS, CC, TM, ML, TZ, GQ, LA, RW, TL, MP, GF, MW, MR, KI, AC, PN, VC, TO, TC, SZ, SO, GS, KG, UZ, CG, MS, CK, SM, NF, TK, GD, BV, SC, LR, NA, SB, SL, TD, NE, IO, GA, NU, WF, ER, MF, LS, PW, TG, +-------+-------------+ N = 130576 Table 2: Countries 1 Refer to ISO 3166 for country codes http://www.iso.org/iso/english_country_names_and_code_elements 3

Activity Types Since Wikipedia can be used in many different ways, the key feature to differentiate respondents besides language and demographic characteristics is provided by the activities the respondents perform. The first question of the questionnaire was obligatory to answer and allowed to differentiate between users and different types of contributors. Type N % Reader 81677 62.55 Occasional Contributor 31981 24.49 Regular Contributor 11992 9.18 Author [8485] [6.50] Editor [7791] [6.97] Administrator [1278] [0.98] Ex Contributor 3625 2.78 No more activity [1585] [1.21] Other 1301 1.00 130576 100.00% Table 3: Detailed categories of respondents Not surprisingly, pure readers provide the majority of the respondents (62.5%). However, the share of almost 34% of the respondents that contribute actively to Wikipedia appears considerably large. Since it is impossible to quantify the universe of people using Wikipedia and how many of them are contributors it will be hard to evaluate whether the shares we found in our survey are representative. The largest share of contributors (24.5%) is provided by respondents that consider themselves as readers but occasionally contribute as editors or authors. Among respondents occasional contributions are the typical way of Wikipedia content creation. However, this does not imply that most of the content is created this way. This question will be examined in later parts of the analysis. Regular contributors (authors, editors, and administrators overlap possible) represent about 9% of the respondents. Approximately 1.0% of the respondents are Administrators. It is noteworthy to mention that among the resopndents there are more than 3,600 persons that contributed to Wikipedia in the past but meanwhile stopped participating actively. Since this group has a unique insight and relationship to Wikipedia, it is analyzed separately. A remarkable share of this group (44%) checked no other activity, which seem to imply that these ex contributors do not even consider themselves anymore as readers of Wikipedia (though this raises the question how this group has heard of the survey). As a conclusion, our profile of Wikipedia activity groups takes account of a group that explicitly pursues no activity at all (share in overall response: 1.2%). The Other category combines respondents who specified they were either Other non-contributors or Other contributors), but did not check any of the other options. Further analysis of the explanation text that these respondents submitted is required to either group them within existing categories or analyze separately. 4

It is evident that such a detailed and uneven break-down of activities is not useful for the further analysis of demographics and other characteristics that are presented in the following sections. Therefore, for the following parts of the analysis we will use a break-down that subsumes these activities in only two categories, readers (65%) and contributors (35%) excluding Ex Contributors and Others. Type N % Reader 81677 65.00 Contributor 43973 35.00 Table 4: Main categories Demographics Age 125,650 100 Valid responses were received from respondents between 10 85 years. Overall, the average age of the Wikipedians that have participated in the survey is 25.8 years. Half of the respondents are are younger than 22 years. The most frequent age that can be observed within the repondents is 18 years. Splitting the respondents in four equally large age groups shows that 25% are younger than 18 years old, 25% are between 18 and 22, a further 25% are between 23 and 30 (e.g. half of the respondents are between 18 and 30 years) and the remaining 25% are between 31 and 85 years old.. Although the age difference between readers and contributors appears marginal - readers are, on average, 25.3 years old while contributors show an average age of 26.8 years a t-test revealed that this age difference is statistically significant. Finally, female Wikipedians are significantly younger (24.2 years) than male community members (26.4 years). Gender Though both groups are dominated by men, there are significant differences in the gender composition of readers and contributors of Wikipedia. Contributors show a substantially larger share of males than readers. Gender Reader Contributor Total Male 55648 (59.45%) 37958 (40.55%) 93606 (68.28%) (86.56%) Female 25451 (18.10%) 5626 (81.90%) 31077 (31.23%) (12.83%) Other 398 266 664 Total 81497 43850 N = 125,347 Table 5: Gender Composition of Wikipedia Activity Groups 5

Education Given the young age of Wikipedians it is just natural that secondary and undergraduate tertiary education provide the prevailing highest educational degrees among the respondents. Contributors show a slightly but significantly better education than readers. Reader (%) Contributor (%) Total (%) Primary education 14.77 12.36 13.93 Secondary education 37.69 33.45 36.20 Tertiary education - Undergraduate 29.17 28.38 28.89 Tertiary education Masters 10.11 15.07 11.85 Tertiary education PhD 2.30 4.59 3.10 Other 5.96 6.15 6.03 N = 124,752 100% 100% 100% Table 6: Highest Educational Degree by Wikipedia Activity Groups Relationship Status The status of the respondents regarding partnerships also corresponds to the age structure of the Wikipedia community, as only 30% of the respondents say they have a partner. Correspondingly, the share of Wikipedians with children is only 14%. The difference between readers and contributors in this regard is negligible. Motivations and Incentives Motivation to contribute In this section we examine some incentives and disincentives to contribute to Wikipedia. On average, contributors spend 4.3 hours per week contributing to Wikipedia. There is a wide range of motivations behind people's active engagement in Wikipedia, but two motivations stand out clearly: the wish to share knowledge and the desire to fix errors. Motivation (max 4 answers, ranked by importance, 1 = most important to 4 = least important) I like the idea of sharing knowledge and want to contribute to it 1.68 I saw an error I wanted to fix 1.94 I do it for professional reasons (it belongs to my professional tasks) 2.32 I want to learn new skills / acquire new knowledge 2.46 Because I think information should be freely available to everyone 2.71 Don't know/can't remember/no real reason 2.79 I want to make topics more popular that are not widely known yet 2.80 I think the Internet provides a better medium for encyclopediae than traditional media 2.84 Mean 6

Motivation (max 4 answers, ranked by importance, 1 = most important to 4 = least important) I want to demonstrate my knowledge to a wider public / community 2.86 Because I like Wikipedia s philosophy of openness and collaboration 2.88 I saw a red link so I wrote a new article 2.91 To improve my job / career opportunities 2.96 To make the world a better place 2.99 To earn money 3.03 Because friends of mine are doing it and motivated me to join 3.12 Because I like mass collaboration/cooperation 3.15 Just to see if it is really open for anyone to edit 3.18 To gain a reputation in the Wikipedia community 3.30 Total N=43973 Mean Table 7: Reasons why people contribute to Wikipedia Reasons for not contributing The attractiveness of Wikipedia is particularly visible if the non-contributors (i.e. readers) are regarded. Two thirds of the respondents that do not contribute to Wikipedia have at least considered to contribute (e.g. by creating a new article, edit an article or participate in a discussion, etc.), only 26% did not consider this, and 8% replied that they don't know. The reasons given by those who never considered to contribute (considering the aggregate data) show two main reasons and four less important factors. Not having enough information to contribute or being happy to just read Wikipedia are the two options that have been checked by roughly half of the respondents. Four other options were selected by about a quarter of the respondents respectively. The most frequently selected options that would make non-contributors contribute are help with identification of content areas that need input, and some form of feedback that would show how contributions would help others. Answer Option % I don't know how 25.18 I am not sufficiently comfortable with the technology 12.49 I would never interact on the internet 4.72 Others are already doing it, there is no need for me 18.88 I don't have time 31.48 I don't feel comfortable editing other peoples' work 26.43 I don't think I have enough information to contribute 51.98 I am afraid of making a mistake and getting in trouble for it 24.43 It's a waste of time: my edits would be reverted or overwritten 7.02 I am happy just to read it; I don't need to write it 48.53 Other 3.94 Don't know 3.99 Checked none of the options 0.93 7

Total N=21432 Answer Option % Table 8: Reasons of Non-contributors for not Contributing to Wikipedia Answer Option % Someone would show me how to do it 21.19 The technology was easier to use 8.83 I knew that other contributors would be welcoming and encouraging 15.69 I knew there were specific topic areas that needed my help 42.23 I was confident my contributions would be valued and kept 25.46 It was clear to me that other people would benefit from my efforts 32.95 Other / don't know / don't want to say 31.18 N=21432 Table 9: Factors that would make contribution more likely Motivation to donate to the Wikimedia Foundation In this section we examine some incentives and disincentives to donate to the Wikimedia Foundation. Regarding the reasons why members of the Wikipedia community do not donate to Wikimedia foundation age structure of the Wikipedia community is a main factor: Many community members are still going to school and have no income that would allow them to donate. Answer Option % of respondents I don't know how to do that 42.03 I did not know it is supported by a non-profit organisation 19.73 I donate my time instead of money 18.88 I never donate to charities 18.75 Donations to the Wikipemedia Foundation are not taxdeductible where I live 16.34 I can't afford to make a donation 15.54 Don't know 13.18 I was never asked to/ wouldn't know how 11.98 I disagree with Wikipedia's policies and practices 9.70 It seems that enough of the other people are making donations to keep the project running 3.84 I do not trust that my donation will be used wisely 1.90 N=130576 Table 10: Reasons not to donate money to Wikimedia 8

Another barrier to donate money to Wikipedia results from a lack of knowledge that this opportunity exists or how to donate money. The least important reason for not donating money to Wikipedia is disagreement with Wikipedia's policies and practices. Lack of knowledge about the legal status of Wikipedia and how to do donations is comparably stronger pronounced among the readers than the contributors. Contributors show only one item that they emphasise much stronger than the average as a reason not to donate money to Wikipedia: they consider the time they spend in contributing to Wikipedia as their donation. Reason (multiple responses) Readers Contributors All respondents I can't afford to make a donation 20.66 6.26 15.54 I was never asked to/ wouldn't know how 12.39 11.31 11.98 I don't know how to do that 41.85 43.04 42.03 I donate my time instead of money 9.85 34.62 18.88 It seems that enough of the other people are making donations to keep the project running I did not know it is supported by a non-profit organisation I do not trust that my donation will be used wisely 3.63 4.10 3.84 23.75 12.84 19.73 1.12 2.52 1.90 I never donate to charities 21.37 14.25 18.75 Don't know 14.36 10.57 13.18 Donations to the Wikipemedia Foundation are not tax-deductible where I live I disagree with Wikipedia's policies and practices 15.49 18.10 16.34 10.77 7.99 9.70 N=81677 N=43973 N=130576 Table 11: Reasons not to donate money to Wikimedia (by activity type) 9