AyuntamientoMadrid/participacion

View on GitHub
app/models/machine_learning.rb

Summary

Maintainability
F
4 days
Test Coverage

Possible command injection
Open

      output = `cd #{SCRIPTS_FOLDER} && #{command} 2>&1`
Severity: Minor
Found in app/models/machine_learning.rb by brakeman

Injection is #1 on the 2010 OWASP Top Ten web security risks. Command injection occurs when shell commands unsafely include user-manipulatable values.

There are many ways to run commands in Ruby:

`ls #{params[:file]}`

system("ls #{params[:dir]}")

exec("md5sum #{params[:input]}")

Brakeman will warn on any method like these that uses user input or unsafely interpolates variables.

See the Ruby Security Guide for details.

Class MachineLearning has 45 methods (exceeds 20 allowed). Consider refactoring.
Open

class MachineLearning
  attr_reader :user, :script, :previous_modified_date
  attr_accessor :job

  SCRIPTS_FOLDER = Rails.root.join("public", "machine_learning", "scripts").freeze
Severity: Minor
Found in app/models/machine_learning.rb - About 6 hrs to fix

    File machine_learning.rb has 414 lines of code (exceeds 250 allowed). Consider refactoring.
    Open

    class MachineLearning
      attr_reader :user, :script, :previous_modified_date
      attr_accessor :job
    
      SCRIPTS_FOLDER = Rails.root.join("public", "machine_learning", "scripts").freeze
    Severity: Minor
    Found in app/models/machine_learning.rb - About 5 hrs to fix

      Method run has 43 lines of code (exceeds 25 allowed). Consider refactoring.
      Open

        def run
          begin
            export_proposals_to_json
            export_budget_investments_to_json
            export_comments_to_json
      Severity: Minor
      Found in app/models/machine_learning.rb - About 1 hr to fix

        Method description_from has a Cognitive Complexity of 12 (exceeds 5 allowed). Consider refactoring.
        Open

            def description_from(script_filename)
              description = ""
              delimiter = '"""'
              break_line = "<br>"
              comment_found = false
        Severity: Minor
        Found in app/models/machine_learning.rb - About 1 hr to fix

        Cognitive Complexity

        Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

        A method's cognitive complexity is based on a few simple rules:

        • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
        • Code is considered more complex for each "break in the linear flow of the code"
        • Code is considered more complex when "flow breaking structures are nested"

        Further reading

        Method import_ml_proposals_tags has a Cognitive Complexity of 11 (exceeds 5 allowed). Consider refactoring.
        Open

            def import_ml_proposals_tags
              ids = {}
              json_file = data_folder.join(MachineLearning.proposals_tags_filename)
              json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
              json_data.each do |attributes|
        Severity: Minor
        Found in app/models/machine_learning.rb - About 1 hr to fix

        Cognitive Complexity

        Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

        A method's cognitive complexity is based on a few simple rules:

        • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
        • Code is considered more complex for each "break in the linear flow of the code"
        • Code is considered more complex when "flow breaking structures are nested"

        Further reading

        Method import_ml_investments_tags has a Cognitive Complexity of 11 (exceeds 5 allowed). Consider refactoring.
        Open

            def import_ml_investments_tags
              ids = {}
              json_file = data_folder.join(MachineLearning.investments_tags_filename)
              json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
              json_data.each do |attributes|
        Severity: Minor
        Found in app/models/machine_learning.rb - About 1 hr to fix

        Cognitive Complexity

        Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

        A method's cognitive complexity is based on a few simple rules:

        • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
        • Code is considered more complex for each "break in the linear flow of the code"
        • Code is considered more complex when "flow breaking structures are nested"

        Further reading

        Method run has a Cognitive Complexity of 11 (exceeds 5 allowed). Consider refactoring.
        Open

          def run
            begin
              export_proposals_to_json
              export_budget_investments_to_json
              export_comments_to_json
        Severity: Minor
        Found in app/models/machine_learning.rb - About 1 hr to fix

        Cognitive Complexity

        Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

        A method's cognitive complexity is based on a few simple rules:

        • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
        • Code is considered more complex for each "break in the linear flow of the code"
        • Code is considered more complex when "flow breaking structures are nested"

        Further reading

        Method data_output_files has 26 lines of code (exceeds 25 allowed). Consider refactoring.
        Open

            def data_output_files
              files = { tags: [], related_content: [], comments_summary: [] }
        
              if File.exist?(data_folder.join(proposals_tags_filename))
                files[:tags] << proposals_tags_filename
        Severity: Minor
        Found in app/models/machine_learning.rb - About 1 hr to fix

          Method import_ml_proposals_tags has 26 lines of code (exceeds 25 allowed). Consider refactoring.
          Open

              def import_ml_proposals_tags
                ids = {}
                json_file = data_folder.join(MachineLearning.proposals_tags_filename)
                json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                json_data.each do |attributes|
          Severity: Minor
          Found in app/models/machine_learning.rb - About 1 hr to fix

            Method import_ml_investments_tags has 26 lines of code (exceeds 25 allowed). Consider refactoring.
            Open

                def import_ml_investments_tags
                  ids = {}
                  json_file = data_folder.join(MachineLearning.investments_tags_filename)
                  json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                  json_data.each do |attributes|
            Severity: Minor
            Found in app/models/machine_learning.rb - About 1 hr to fix

              Method data_output_files has a Cognitive Complexity of 8 (exceeds 5 allowed). Consider refactoring.
              Open

                  def data_output_files
                    files = { tags: [], related_content: [], comments_summary: [] }
              
                    if File.exist?(data_folder.join(proposals_tags_filename))
                      files[:tags] << proposals_tags_filename
              Severity: Minor
              Found in app/models/machine_learning.rb - About 45 mins to fix

              Cognitive Complexity

              Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

              A method's cognitive complexity is based on a few simple rules:

              • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
              • Code is considered more complex for each "break in the linear flow of the code"
              • Code is considered more complex when "flow breaking structures are nested"

              Further reading

              Method import_budget_investments_related_content has a Cognitive Complexity of 8 (exceeds 5 allowed). Consider refactoring.
              Open

                  def import_budget_investments_related_content
                    json_file = data_folder.join(MachineLearning.investments_related_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |related|
                      id = related.delete(:id)
              Severity: Minor
              Found in app/models/machine_learning.rb - About 45 mins to fix

              Cognitive Complexity

              Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

              A method's cognitive complexity is based on a few simple rules:

              • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
              • Code is considered more complex for each "break in the linear flow of the code"
              • Code is considered more complex when "flow breaking structures are nested"

              Further reading

              Method import_proposals_related_content has a Cognitive Complexity of 8 (exceeds 5 allowed). Consider refactoring.
              Open

                  def import_proposals_related_content
                    json_file = data_folder.join(MachineLearning.proposals_related_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |related|
                      id = related.delete(:id)
              Severity: Minor
              Found in app/models/machine_learning.rb - About 45 mins to fix

              Cognitive Complexity

              Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

              A method's cognitive complexity is based on a few simple rules:

              • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
              • Code is considered more complex for each "break in the linear flow of the code"
              • Code is considered more complex when "flow breaking structures are nested"

              Further reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_ml_investments_tags
                    ids = {}
                    json_file = data_folder.join(MachineLearning.investments_tags_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |attributes|
              Severity: Major
              Found in app/models/machine_learning.rb and 1 other location - About 3 hrs to fix
              app/models/machine_learning.rb on lines 373..398

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 118.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_ml_proposals_tags
                    ids = {}
                    json_file = data_folder.join(MachineLearning.proposals_tags_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |attributes|
              Severity: Major
              Found in app/models/machine_learning.rb and 1 other location - About 3 hrs to fix
              app/models/machine_learning.rb on lines 403..428

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 118.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_proposals_related_content
                    json_file = data_folder.join(MachineLearning.proposals_related_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |related|
                      id = related.delete(:id)
              Severity: Major
              Found in app/models/machine_learning.rb and 1 other location - About 1 hr to fix
              app/models/machine_learning.rb on lines 345..369

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 74.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_budget_investments_related_content
                    json_file = data_folder.join(MachineLearning.investments_related_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |related|
                      id = related.delete(:id)
              Severity: Major
              Found in app/models/machine_learning.rb and 1 other location - About 1 hr to fix
              app/models/machine_learning.rb on lines 317..341

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 74.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_ml_investments_comments_summary
                    json_file = data_folder.join(MachineLearning.investments_comments_summary_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |attributes|
                      attributes.delete(:id)
              Severity: Minor
              Found in app/models/machine_learning.rb and 1 other location - About 40 mins to fix
              app/models/machine_learning.rb on lines 293..300

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 37.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def import_ml_proposals_comments_summary
                    json_file = data_folder.join(MachineLearning.proposals_comments_summary_filename)
                    json_data = JSON.parse(File.read(json_file)).each(&:deep_symbolize_keys!)
                    json_data.each do |attributes|
                      attributes.delete(:id)
              Severity: Minor
              Found in app/models/machine_learning.rb and 1 other location - About 40 mins to fix
              app/models/machine_learning.rb on lines 305..312

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 37.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def cleanup_investments_tags!
                    Tagging.where(context: "ml_tags", taggable_type: "Budget::Investment").find_each(&:destroy!)
                    Tag.find_each { |tag| tag.destroy! if Tagging.where(tag: tag).empty? }
              Severity: Minor
              Found in app/models/machine_learning.rb and 1 other location - About 15 mins to fix
              app/models/machine_learning.rb on lines 267..269

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 25.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              Similar blocks of code found in 2 locations. Consider refactoring.
              Open

                  def cleanup_proposals_tags!
                    Tagging.where(context: "ml_tags", taggable_type: "Proposal").find_each(&:destroy!)
                    Tag.find_each { |tag| tag.destroy! if Tagging.where(tag: tag).empty? }
              Severity: Minor
              Found in app/models/machine_learning.rb and 1 other location - About 15 mins to fix
              app/models/machine_learning.rb on lines 272..274

              Duplicated Code

              Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

              Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

              When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

              Tuning

              This issue has a mass of 25.

              We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

              The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

              If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

              See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

              Refactorings

              Further Reading

              There are no issues that match your filters.

              Category
              Status