User_profile_creation

class user_profile_creation.extract_user_profiles(univ_name, output_path)[source]

Bases: object

Class which can extract profiles of all users from a university

extract_info(url, user_id)[source]

Function to extract a particular user’s information from general university URL

Parameters
  • url (str) – The URL from which response is to be retrieved

  • user_id (str) – ID of the particular scholar

Returns

None

extract_profiles()[source]

Function to compile Scholar data of a particular university. The function will first identify the total number of scholars in a university and then get basic summary available for each scholar.

Parameters

None

Returns

None

get_awards()[source]

Function to Research areas of the Scholar from University Page

Parameters

None

Returns

Research areas of the Scholar, Length of research_areas

Return type

Tuple (List, Int)

get_department()[source]

Function to extract Department of the Scholar from University Page

Parameters

None

Returns

Department of the Scholar

Return type

str

get_department_info()[source]

Function to extract Department info (including course area) of the Scholar from University Page

Parameters

None

Returns

Department info of the Scholar

Return type

str

get_email()[source]

Function to extract email of the Scholar from University Page

Parameters

None

Returns

Email of the Scholar

Return type

str

get_keywords()[source]

Function to extract keywords of the Scholar from University Page

Parameters

None

Returns

Keywords of the Scholar

Return type

str

get_name()[source]

Function to extract name of the Scholar from University Page

Parameters

None

Returns

Name of the Scholar

Return type

str

get_netid()[source]

Function to extract NetID (University Unique Identifier) of the Scholar from University Page

Parameters

None

Returns

NetID of the Scholar

Return type

str

get_npublications()[source]

Function to get the no of publciations of the Scholar from University Page

Parameters

None

Returns

Publications of the Scholar

Return type

str

get_organizations()[source]

Function to extract Organizations of the Scholar from University Page

Parameters

None

Returns

Organizations of the Scholar

Return type

str

get_overview()[source]

Function to extract Overview of the Scholar from University Page

Parameters

None

Returns

Overview of the Scholar

Return type

str

get_profile(url, user_id)[source]

Function to extract all details of a scholar from University Page

Parameters
  • url (str) – The base university URL from which Scholars’ data can be extracted by appending their user_ids

  • user_id (str) – The university provided User ID of the scholar

Returns

Scholar Data in the form of Pandas.DataFrame

Return type

Pandas.DataFrame

get_publications()[source]

Function to extract publications of the Scholar from University Page

Parameters

None

Returns

Publications of the Scholar

Return type

str

get_research()[source]

Function to Research areas of the Scholar from University Page

Parameters

None

Returns

Research areas of the Scholar, Length of research_areas

Return type

Tuple (List, Int)

get_title()[source]

Function to extract Prefered title of the Scholar from University Page

Parameters

None

Returns

Preferred title of the Scholar

Return type

str

user_profile_creation.get_userid(user_dict, key)[source]

Function to get User IDs from JSON

Parameters

user_dict (JSON) – The URL from which response is to be retrieved

Returns

User ID

Return type

str