Create_analytical_data

class create_analytical_data.Analytical_Data_Creator(n_cores, univ_name, params)[source]

Bases: object

Class which will create a user (read scholar) database with details from his profile page and relevant publications.

Parameters
  • user_organisation (str) – Text from which kerwords are to be extracted

  • i (str) – User ID (dummy_variable)

Returns

Space separated set of keywords

Return type

str

create_publication_data()[source]

Main function to process and compile the final Scholars’ dataset

Parameters

None

Returns

None

create_user_token_data()[source]

Function to create tokens from Profile page (Organization, Overview and Keyword sections) for all users

Parameters

None

Returns

None

create_analytical_data.get_author_pubinfo(scholar_df, i, top_n=5, top_title=True)[source]

Function to extract information of top N publications of the author

Parameters
  • scholar_df (Pandas.DataFrame) – DataFram containing user’s all information

  • i (str) – User ID (dummy_variable)

  • top_n (int) – Based on relevancy, the number of top Titles will be used

  • top_title (bool) – If True, only top N pulications will be extracted. Else all publicatio data will be used.

Returns

Tuple of Dictionaries. Each distionary contain User_id as key and keyworks from Publication title / User keywords as values

Return type

Tuple

create_analytical_data.user_keywords(user_key, i)[source]

Function to calculate tokens from user’s keywords

Parameters
  • user_key (str) – Text from which kerwords are to be extracted

  • i (str) – User ID (dummy_variable)

Returns

Space separated set of keywords

Return type

str

create_analytical_data.user_o_keywords(user_overview, i)[source]

Function to calculate tokens from user’s Overview

Parameters
  • user_overview (str) – Text from which kerwords are to be extracted

  • i (str) – User ID (dummy_variable)

Returns

Space separated set of keywords

Return type

str

create_analytical_data.user_org_keywords(user_organisation, i)[source]

Function to calculate tokens from user’s organization

Parameters
  • user_organisation (str) – Text from which kerwords are to be extracted

  • i (str) – User ID (dummy_variable)

Returns

Space separated set of keywords

Return type

str