Extract_publications

class extract_publications.Extract_Publications(n_cores, univ_name, params)[source]

Bases: object

Class which will extract all the publication details of all the scholars of a given university

create_univ_publication_data()[source]

Main function which will create the publication data for all the users of a university

Parameters

None

Returns

None

create_user_publication_data(user_id, pub_ids)[source]

Creates the publication data for a single user by scraping university webpage. The function takes in a publication ID and get complete detail of the data from University webpage.

Parameters
  • user_id (str) – User_id for each user whose publications are to be extracted

  • pub_ids (List) – List of all publication IDs for the user

Returns

Dataframe for each user where each row is a publication of a user. Total no of rows = n_publications

Return type

class Pandas.DataFrame

get_publication_ids(user_id, str_)[source]

Returns a dictionary where each user_id is a key and a list of his/her publications as values

Parameters

user_id (str) – User_id for each user whose publications are to be extracted

param str_: WIP :type str_: str

Returns

Dictonary of {User IDs : List of publications}

Return type

class `Dictionary `

save_user_publications(univ_name='TAMU')[source]

Function to save the publication details of each user

Parameters

univ_name (`str1) – University name of the user - which determines the file names for saving.