Extract_proposals

class extract_proposals.GrantsDataExtractor(xml_url, csv_url, agencies, params)[source]

Bases: object

Class which will extract data from the Grants.Gov website.

As per design, we will first download the list of all Open proposals from the Grants.gov. Later for each proposal, further data is extracted from the dedicated webpage (for example from NSF website).

ExtractCSVData()[source]

Function to extract data from the downloaded CSV file Once the data is extracted it will be saved as a dataframe - self.metadata

Parameters

None

Returns

None

ExtractXMLData()[source]

Function to extract data from the XML file. Once the data is extracted it will be saved as a dataframe - self.opps_df

Parameters

None

Returns

None

ProcessXMLData()[source]

Function to process extracted the XML data. Reformat columns - CloseDate, PostDate. LastUpdateDate. Identify Open Proposals.

:param None :

Returns

None

SaveXMLData()[source]

Function to save all the XML Data to CSV files. Specifically, Open Proposals agency wise will be saved in seprate files.

:param None :

Returns

None