Kardashian index plot for data available in the Brede Wiki¶

Around 90 researchers listed in the Brede Wiki have both an Google Scholar identifier and and Twitter identifier associated with them. These can be extracted and Google Scholar and Twitter can be queried to get citation count and followers count so the Kardashian index (K-index) can be computed.

See the plot and the Kardashian index at the bottom.

The list of the included researchers is here: http://neuro.compute.dtu.dk/wiki/Category:Researchers_in_Google_Scholar_and_Twitter

Read more about the Kardashian index here: http://neuro.compute.dtu.dk/wiki/Kardashian_index

New coefficients for power law for Google Scholar/Twitter are computed. The power law is 50.5 x citations ^ 0.28.

To run the below code you need to setup a python.cfg configuration file with the four Twitter tokens, user-agent and 'from' email.

In [2]:

from __future__ import division, print_function
try:
    import ConfigParser as configparser
except ImportError:
    import configparser
import copy
import json
import matplotlib
import matplotlib.pyplot as plt
import numpy as np
import os
import pandas as pd
import re
import requests
import scipy.spatial
import time
import tweepy
from urllib import quote

In [2]:

# Read configuration file for request user-agent and Twitter tokens
config = configparser.ConfigParser()
config.read(os.path.expanduser('~/etc/python.cfg'))

headers = {
    'User-Agent': config.get('requests', 'user_agent'),
    'From': config.get('requests', 'from')
}

In [3]:

# Fetch name of Brede Wiki page with researchers on Google Scholar and Twitter
url_gst = ('http://neuro.compute.dtu.dk/w/api.php?'
           'action=query&format=json&list=categorymembers&'
           'cmtitle=Category:Researchers in Google Scholar and Twitter')

response = requests.get(url_gst, headers=headers).json()
pagetitles = [page['title'] for page in response['query']['categorymembers']]
while 'query-continue' in response:
    url_continue = url_gst + '&cmcontinue=' + response['query-continue']['categorymembers']['cmcontinue'].encode('utf-8')
    response = requests.get(url_continue).json()
    pagetitles.extend([page['title'] for page in response['query']['categorymembers']])
    

In [4]:

print(pagetitles)

[u'Aaron Clauset', u'Aaron Halfaker', u'Alan Mislove', u'Alan Ruttenberg', u'Albert Gjedde', u'Alex O. Holcombe', u'Alex Verstak', u'Alon Halevy', u'Amy Bruckman', u'Anders Stockmarr', u'Andrea Cuttone', u'Andrea Forte', u'Andreas Roepstorff', u'Andrius Butkus', u'Antonia F. de C. Hamilton', u'Arkadiusz Stopczynski', u'Arto Lanam\xe4ki', u'Benjamin Mako Hill', u'Bjarne \xd8rum Fruergaard', u'Brian Keegan', u'C. Lee Giles', u'Carsten Stahlhut', u'Chitu Okoli', u'Chris D. Frith', u'Christian Kohlsch\xfctter', u'Cliff Lampe', u'Cyril Goutte', u'Dan Jurafsky', u'Daniel M. Romero', u'Daniel Mietchen', u'Dario Taraborelli', u'David D. Cox', u'David Laniado', u'David M. Blei', u'David M. Pennock', u'David N. Kennedy', u'Denny Vrande\u010di\u0107', u'Dirk Riehle', u'Dominik Benz', u'Ed H. Chi', u'Edward T. Bullmore', u'Egon Willighagen', u'Elanor Colleoni', u'Emilio Ferrara', u'Eszter Hargittai', u'Felipe Ortega', u'Finn \xc5rup Nielsen', u'Frank G. A. de Bakker', u'Friederike Schultz', u'Ga\xebl Varoquaux', u'Geoffrey Karl Aguirre', u'Geraint Rees', u'Haewoon Kwak', u'Henry Markram', u'J. Andreas B\xe6rentzen', u'J. Douglas Bremner', u'Jakob Eg Larsen', u'James W. Pennebaker', u'Jan Larsen (DTU Compute)', u'Jens Madsen', u'Joseph R. Kiniry', u'Jure Leskovec', u'Kasper Winther Andersen', u'Katrin Weller', u'Lada A. Adamic', u'Lars Kai Hansen', u'Lars Ramkilde Knudsen', u'Lasse L. M\xf8lgaard', u'Leon French', u'Luca de Alfaro', u'Magnus Manske', u'Marcus Messner', u'Maria Grineva', u'Matthew G. Liptrot', u'Michael Cook', u'Michael Kai Petersen', u'Mikkel N. Schmidt', u'Mikkel Wallentin', u'Morten M\xf8rup', u'Nikola Milikic', u'Noshir S. Contractor', u'Olaf Sporns', u'Paolo Massa', u'Paul Groth', u'Peter Sheridan Dodds', u'Peter Vuust', u'Petter Holme', u'Phoebe Ayers', u'Piotr Konieczny', u'R. Stuart Geiger', u'Rada Mihalcea', u'Radu Dr\u0103gu\u0219in', u'Rapha\xebl Troncy', u'Renaud Lambiotte', u'Roderick Murray-Smith', u'Russell A. Poldrack', u'Ryota Kanai', u'Ryota Tomioka', u'Sarita Yardi', u'Simon Due Kamronn', u'Stanislas Dehaene', u'Stefania Serafin', u'Stephan Gouws', u'Sue B. Moon', u'Sune Lehmann', u'Taha Yasseri', u'Tal Yarkoni', u'Thomas Steiner', u'Tobias S. Andersen', u'Tom M. Mitchell', u'Trine Julie Abrahamsen', u'Uta Frith', u'Vedran Sekara', u'Vibe Geds\xf8 Fr\xf8kj\xe6r', u'Vince Calhoun', u'Wray Buntine', u'Yong-Yeol Ahn']

In [5]:

# Get researcher data from Brede Wiki
url_pages = "http://neuro.compute.dtu.dk/w/index.php?action=raw&title="
pattern_researcher = re.compile('{{Researcher(\s*?\|.*?)}}', 
                           re.DOTALL | re.IGNORECASE | re.UNICODE)
pattern_fields = re.compile(r'\s*\|\s*(\w+)\s*=\s*([^\|]*\w)', 
                            re.DOTALL | re.UNICODE)

researchers = []
for pagetitle in pagetitles:
    response = requests.get(url_pages + quote(pagetitle.encode('utf-8')), headers=headers)
    print(pagetitle)
    researcher = pattern_researcher.findall(response.text)
    if researcher:
        researchers.append(dict(pattern_fields.findall(researcher[0])))
    else:
        print("Problem with " + pagetitle)

Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia F. de C. Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen (DTU Compute)
Jens Madsen
Joseph R. Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn

In [6]:

researchers[14]

Out[6]:

{u'affiliation1': u'Institute of Cognitive Neuroscience',
 u'affiliation2': u'School of Psychology, University of Nottingham',
 u'affiliation3': u'Sobell Department of Motor Neuroscience and Movement Disorders',
 u'country': u'United Kingdom',
 u'family_name': u'Hamilton',
 u'googlescholar': u'vNqtGKcAAAAJ',
 u'initials': u'AF',
 u'name': u'Antonia Hamilton',
 u'position': u'Senior Lecturer',
 u'scopus': u'7202618686',
 u'scopus1': u'38060964400',
 u'twitter': u'antoniahamilton',
 u'url1': u'http://www.antoniahamilton.com/people.html'}

In [7]:

url_gs = 'http://scholar.google.com/citations?user='

headers = {
    'User-Agent': config.get('requests', 'user_agent'),
    'From': config.get('requests', 'from')
}

# <td class="cit-borderleft cit-data">2537</td>
pattern_gscount = re.compile('<td class="gsc_rsb_std">(\d+)</td>')

def get_google_scholar_counts(google_scholar_id):
    response = requests.get(url_gs + google_scholar_id, 
                            headers=headers)
    counts = dict(zip(['citations', 'citations5', 'h-index', 'h-index5',
                        'i10-index', 'i10-index5'],
                        map(int, pattern_gscount.findall(response.text))))
    return counts
    

In [8]:

# Yong-Yeol Ahn check
get_google_scholar_counts('US7OSNgAAAAJ')

Out[8]:

{'citations': 3379,
 'citations5': 3181,
 'h-index': 16,
 'h-index5': 16,
 'i10-index': 18,
 'i10-index5': 17}

In [9]:

# Get data from Google Scholar
for researcher in researchers:
    if 'citations' not in researcher:
        print(researcher['name'])
        researcher.update(get_google_scholar_counts(researcher['googlescholar']))
        time.sleep(5)  
    

Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen
Jens Madsen
Joseph R Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Heinrich Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi Schoenebeck
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn

In [10]:

# Saving just in case
json.dump(researchers, open('researchers.json', 'w'))

In [11]:

# Twitter authentication
auth = tweepy.OAuthHandler(config.get('twitter', 'consumer_key'),
                           config.get('twitter', 'consumer_secret'))
auth.set_access_token(config.get('twitter', 'access_token'),
                      config.get('twitter', 'access_secret'))

In [12]:

# Function to download data from Twitter profiles
api = tweepy.API(auth)

def get_twitter_count(twitter_id):
    try:
        user = api.get_user(twitter_id)
        counts = {
            'Followers count': user.followers_count,
            'Friends count': user.friends_count,
            'Statuses count': user.statuses_count
            }
        return counts
    except Exception:
        print('Problem with ' + twitter_id)
        return {}

In [13]:

# Testing with Finn Aarup Nielsen (fnielsen)
get_twitter_count('fnielsen')

Out[13]:

{'Followers count': 411, 'Friends count': 351, 'Statuses count': 2569}

In [14]:

# Download data from Twitter
for researcher in researchers:
    researcher.update(get_twitter_count(researcher['twitter']))
    print(researcher['name'])

Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen
Jens Madsen
Joseph R Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Heinrich Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi Schoenebeck
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn

In [15]:

# Save just in case
json.dump(researchers, open('researchers.json', 'w'))

In [3]:

researchers = json.load(open('researchers.json'))

In [4]:

researchers[0]

Out[4]:

{u'Followers count': 1733,
 u'Friends count': 95,
 u'Statuses count': 575,
 u'affiliation1': u'University of Colorado at Boulder',
 u'citations': 8346,
 u'citations5': 7446,
 u'country': u'United States of America',
 u'family_name': u'Clauset',
 u'googlescholar': u'e7VI_HcAAAAJ',
 u'h-index': 20,
 u'h-index5': 18,
 u'i10-index': 28,
 u'i10-index5': 27,
 u'initials': u'A',
 u'name': u'Aaron Clauset',
 u'position': u'Assistant Professor',
 u'scopus': u'8298347800',
 u'twitter': u'aaronclauset',
 u'url1': u'http://tuvalu.santafe.edu/~aaronc',
 u'url2': u'http://cs.unm.edu/~aaron/blog',
 u'videolectures': u'aaron_clauset'}

In [5]:

# Pandas!
df = pd.DataFrame(researchers)

In [6]:

%matplotlib inline

In [17]:

# isnan: Houston, we've had a problem
indices = (~np.isnan(df['citations'])) & (df['citations'] != 0)
reverse_index = indices[indices].index.values

# Plot the data
matplotlib.rc('font', family='DejaVu Sans')
fig = df.plot(x='citations', y='Followers count',
                       kind='scatter', figsize=(15, 10), 
                       marker='*', s=df['Statuses count']/10,
                       linewidth=2, color=(0.8, 0.8, 0.8))
ax = plt.gca()
ax.set_xscale('log')
ax.set_yscale('log')
plt.xlabel('Google Scholar citations')
plt.ylabel('Twitter followers count')
plt.title('Kardashian index for Brede Wiki researchers on Google Scholar and Twitter')

# Power law fit
p = np.polyfit(np.log(df.ix[indices, 'citations']), np.log(df.ix[indices, 'Followers count']), 1)
powerlaw = np.frompyfunc(lambda x: np.exp(p[1]) * x ** p[0], 1, 1)
plt.plot([1, 200000], powerlaw([1, 200000]), linewidth=5, color=(0.5, 1, 0.5))
plt.text(10, 5000, '{:.3} x citations^{:0.2}'.format(np.exp(p[1]), p[0]), fontsize=20)

# Annotation of some of the points with researcher names
hull = scipy.spatial.ConvexHull(df.ix[indices, ['citations', 'Followers count']])
for index in hull.vertices:
    x, y, name = df.ix[reverse_index[index], ['citations', 'Followers count', 'name']].values
    try:
        plt.text(x, y, name, horizontalalignment='center', verticalalignment='center')
    except:
        pass
    
# Myself and Ryoto and et al.
family_names = ['Nielsen', 'Tomioka', 'Willighagen']
for family_name in family_names:
    x, y, name = df.ix[df['family_name'] == family_name, ['citations', 'Followers count', 'name']].values.flatten()
    plt.text(x, y, name, horizontalalignment='center', verticalalignment='center')

dummy = plt.axis((1, 200000, 1, 20000))
plt.show()

In [18]:

df.describe()

Out[18]:

	Followers count	Friends count	Statuses count	citations	citations5	h-index	h-index5	i10-index	i10-index5
count	117.000000	117.000000	117.000000	117.000000	117.000000	117.000000	117.000000	117.000000	117.000000
mean	1388.213675	374.042735	2564.042735	7694.487179	4507.735043	27.393162	22.649573	56.606838	47.188034
std	2407.753202	673.039777	4933.751046	15717.475907	7870.026290	29.289859	21.741156	82.538910	67.952561
min	1.000000	0.000000	0.000000	0.000000	0.000000	0.000000	0.000000	0.000000	0.000000
25%	73.000000	52.000000	40.000000	348.000000	308.000000	9.000000	9.000000	9.000000	8.000000
50%	504.000000	189.000000	551.000000	1815.000000	1393.000000	17.000000	16.000000	24.000000	20.000000
75%	1733.000000	435.000000	3053.000000	7424.000000	5018.000000	34.000000	30.000000	66.000000	57.000000
max	15224.000000	5915.000000	31695.000000	124042.000000	51905.000000	179.000000	117.000000	522.000000	415.000000

8 rows × 9 columns

In [19]:

df['K-index'] = df['Followers count'] / powerlaw(df['citations'])

In [20]:

# Identify the 'scientific Kardashians'
high_score = df[indices].sort(columns='K-index', ascending=False)[['name', 'K-index', 'Statuses count']]
high_score

Out[20]:

	name	K-index	Statuses count
104	Sune Lehmann	43.35202	1994
43	Emilio Ferrara	29.38693	4680
61	Jure Leskovec	14.17319	575
111	Uta Frith	13.85914	3760
19	Brian Keegan	13.52433	17571
74	Michael Cook	13.20721	24813
71	Marcus Messner	10.85944	3086
44	Eszter Hargittai	10.48633	3050
29	Daniel Mietchen	10.21248	18336
72	Maria Grineva	10.20603	15734
64	Lada A. Adamic	9.837668	278
98	Sarita Yardi Schoenebeck	8.830271	15314
55	J. Douglas Bremner	7.902007	9638
107	Thomas Steiner	7.387838	6546
49	Gaël Varoquaux	7.323572	1326
39	Ed H. Chi	7.263597	7197
30	Dario Taraborelli	7.039645	4027
106	Tal Yarkoni	6.59077	2586
51	Geraint Rees	6.533561	1246
87	Phoebe Ayers	6.198842	2682
116	Yong-Yeol Ahn	6.043176	8292
17	Benjamin Mako Hill	5.815526	197
80	Noshir S. Contractor	5.768472	899
5	Alex O. Holcombe	5.398742	6329
41	Egon Willighagen	5.352792	31695
89	R. Stuart Geiger	5.329288	3053
8	Amy Bruckman	4.750928	3101
14	Antonia Hamilton	4.601453	1778
63	Katrin Weller	4.519743	3682
37	Dirk Riehle	4.273715	4747
92	Raphaël Troncy	4.229026	5685
25	Cliff Lampe	3.944443	3606
56	Jakob Eg Larsen	3.93527	2468
31	David D. Cox	3.797303	2709
105	Taha Yasseri	3.666922	723
96	Ryota Kanai	3.658165	2230
27	Dan Jurafsky	3.499375	68
15	Arkadiusz Stopczynski	3.370665	4556
95	Russell A. Poldrack	3.091737	551
16	Arto Lanamäki	3.045272	1050
0	Aaron Clauset	2.984011	575
36	Denny Vrandečić	2.847603	1250
45	Felipe Ortega	2.768661	1733
47	Frank G. A. de Bakker	2.726936	4753
11	Andrea Forte	2.69998	1428
83	Paul Groth	2.444785	4791
103	Sue B. Moon	2.39438	3346
7	Alon Halevy	2.314019	328
23	Chris D. Frith	2.212742	107
52	Haewoon Kwak	2.030797	11376
81	Olaf Sporns	1.851942	483
24	Christian Kohlschütter	1.684115	5203
70	Heinrich Magnus Manske	1.650236	2723
34	David M. Pennock	1.610894	834
86	Petter Holme	1.593025	3053
1	Aaron Halfaker	1.566837	692
84	Peter Sheridan Dodds	1.534204	5482
100	Stanislas Dehaene	1.496988	40
97	Ryota Tomioka	1.421273	4815
102	Stephan Gouws	1.345312	1428
	...	...	...

116 rows × 3 columns

In [21]:

# Not all is shown above
# The below code will give the full list:
# https://stackoverflow.com/questions/23388810/ipython-notebook-output-cell-is-truncating-contents-of-my-list
from IPython.display import HTML
HTML(high_score.to_html())

Out[21]:

	name	K-index	Statuses count
104	Sune Lehmann	43.35202	1994
43	Emilio Ferrara	29.38693	4680
61	Jure Leskovec	14.17319	575
111	Uta Frith	13.85914	3760
19	Brian Keegan	13.52433	17571
74	Michael Cook	13.20721	24813
71	Marcus Messner	10.85944	3086
44	Eszter Hargittai	10.48633	3050
29	Daniel Mietchen	10.21248	18336
72	Maria Grineva	10.20603	15734
64	Lada A. Adamic	9.837668	278
98	Sarita Yardi Schoenebeck	8.830271	15314
55	J. Douglas Bremner	7.902007	9638
107	Thomas Steiner	7.387838	6546
49	Gaël Varoquaux	7.323572	1326
39	Ed H. Chi	7.263597	7197
30	Dario Taraborelli	7.039645	4027
106	Tal Yarkoni	6.59077	2586
51	Geraint Rees	6.533561	1246
87	Phoebe Ayers	6.198842	2682
116	Yong-Yeol Ahn	6.043176	8292
17	Benjamin Mako Hill	5.815526	197
80	Noshir S. Contractor	5.768472	899
5	Alex O. Holcombe	5.398742	6329
41	Egon Willighagen	5.352792	31695
89	R. Stuart Geiger	5.329288	3053
8	Amy Bruckman	4.750928	3101
14	Antonia Hamilton	4.601453	1778
63	Katrin Weller	4.519743	3682
37	Dirk Riehle	4.273715	4747
92	Raphaël Troncy	4.229026	5685
25	Cliff Lampe	3.944443	3606
56	Jakob Eg Larsen	3.93527	2468
31	David D. Cox	3.797303	2709
105	Taha Yasseri	3.666922	723
96	Ryota Kanai	3.658165	2230
27	Dan Jurafsky	3.499375	68
15	Arkadiusz Stopczynski	3.370665	4556
95	Russell A. Poldrack	3.091737	551
16	Arto Lanamäki	3.045272	1050
0	Aaron Clauset	2.984011	575
36	Denny Vrandečić	2.847603	1250
45	Felipe Ortega	2.768661	1733
47	Frank G. A. de Bakker	2.726936	4753
11	Andrea Forte	2.69998	1428
83	Paul Groth	2.444785	4791
103	Sue B. Moon	2.39438	3346
7	Alon Halevy	2.314019	328
23	Chris D. Frith	2.212742	107
52	Haewoon Kwak	2.030797	11376
81	Olaf Sporns	1.851942	483
24	Christian Kohlschütter	1.684115	5203
70	Heinrich Magnus Manske	1.650236	2723
34	David M. Pennock	1.610894	834
86	Petter Holme	1.593025	3053
1	Aaron Halfaker	1.566837	692
84	Peter Sheridan Dodds	1.534204	5482
100	Stanislas Dehaene	1.496988	40
97	Ryota Tomioka	1.421273	4815
102	Stephan Gouws	1.345312	1428
79	Nikola Milikic	1.339853	107
109	Tom M. Mitchell	1.313142	21
82	Paolo Massa	1.306788	465
12	Andreas Roepstorff	1.143615	2042
28	Daniel M. Romero	1.042678	339
60	Joseph R Kiniry	1.036185	1480
75	Michael Kai Petersen	1.030835	154
46	Finn Årup Nielsen	0.9939271	2569
91	Radu Drăgușin	0.9096936	30
57	James W. Pennebaker	0.8613386	39
50	Geoffrey Karl Aguirre	0.7240992	357
32	David Laniado	0.6778401	58
3	Alan Ruttenberg	0.6029182	145
48	Friederike Schultz	0.5622809	4
38	Dominik Benz	0.5512532	1151
68	Leon French	0.4934968	162
18	Bjarne Ørum Fruergaard	0.4660093	30
101	Stefania Serafin	0.4615673	351
65	Lars Kai Hansen	0.4436704	1251
42	Elanor Colleoni	0.4111292	31
94	Roderick Murray-Smith	0.380659	554
67	Lasse L. Mølgaard	0.3693654	156
58	Jan Larsen	0.3333603	207
77	Mikkel Wallentin	0.3172575	24
73	Matthew G. Liptrot	0.3059698	506
112	Vedran Sekara	0.2842159	43
85	Peter Vuust	0.2595274	28
33	David M. Blei	0.242567	0
54	J. Andreas Bærentzen	0.2377323	104
93	Renaud Lambiotte	0.2345374	31
110	Trine Julie Abrahamsen	0.222549	9
69	Luca de Alfaro	0.2064739	85
59	Jens Madsen	0.2003703	7
10	Andrea Cuttone	0.1989996	1
13	Andrius Butkus	0.1978109	3
76	Mikkel N. Schmidt	0.1591836	25
62	Kasper Winther Andersen	0.1481508	88
88	Piotr Konieczny	0.1426637	158
53	Henry Markram	0.1372832	0
21	Carsten Stahlhut	0.110743	26
78	Morten Mørup	0.1098451	3
115	Wray Buntine	0.1090059	30
6	Alex Verstak	0.09618637	0
4	Albert Gjedde	0.0933135	162
66	Lars Ramkilde Knudsen	0.08555802	40
22	Chitu Okoli	0.08295347	376
108	Tobias S. Andersen	0.05471306	4
2	Alan Mislove	0.04338656	0
40	Edward T. Bullmore	0.03679437	0
114	Vince Calhoun	0.02110989	0
20	C. Lee Giles	0.02110068	1
113	Vibe Gedsø Frøkjær	0.0203848	8
35	David N. Kennedy	0.01667231	70
90	Rada Mihalcea	0.01475713	0
26	Cyril Goutte	0.004659881	0
9	Anders Stockmarr	0.003353294	0

In [40]:

In [ ]: