usaspending_api/common/etl/spark.py
File spark.py
has 639 lines of code (exceeds 250 allowed). Consider refactoring. Open
Open
"""
Spark utility functions that could be used as stages or steps of an ETL job (aka "data pipeline")
NOTE: This is distinguished from the usaspending_api.common.helpers.spark_helpers module, which holds mostly boilerplate
functions for setup and configuration of the spark environment
Function extract_db_data_frame
has 10 arguments (exceeds 6 allowed). Consider refactoring. Wontfix
Wontfix
def extract_db_data_frame(
Function write_csv_file
has 7 arguments (exceeds 6 allowed). Consider refactoring. Open
Open
def write_csv_file(