whylabs/whylogs-python

View on GitHub

Showing 3,918 of 3,918 total issues

Function _compute_ks_score has a Cognitive Complexity of 18 (exceeds 5 allowed). Consider refactoring.
Open

    def _compute_ks_score(self, target_distribution, reference_distribution, with_thresholds=False):
        QUANTILES = self._parameter_config.quantiles
        if reference_distribution.is_empty() or target_distribution.is_empty():
            return None

Severity: Minor
Found in python/whylogs/viz/drift/column_drift_algorithms.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function submetrics_from_protobuf has a Cognitive Complexity of 18 (exceeds 5 allowed). Consider refactoring.
Open

    def submetrics_from_protobuf(cls: Type[MULTI_METRIC], msg: MetricMessage) -> Dict[str, Dict[str, Metric]]:
        submetrics: Dict[str, Dict[str, Metric]] = {}
        submetric_msgs: Dict[str, Dict[str, Dict[str, MetricComponentMessage]]] = {}
        for key, comp_msg in msg.metric_components.items():
            sub_name_and_type, comp_name = key.split("/")
Severity: Minor
Found in python/whylogs/core/metrics/multimetric.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function merge has a Cognitive Complexity of 18 (exceeds 5 allowed). Consider refactoring.
Open

    def merge(self, other) -> "ModelPerformanceMetrics":
        """
        :type other: ModelMetrics
        """
        if other is None or (other.confusion_matrix is None and other.regression_metrics is None):

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function columnar_update has a Cognitive Complexity of 18 (exceeds 5 allowed). Consider refactoring.
Open

    def columnar_update(self, view: PreprocessedColumn) -> OperationResult:
        successes = 0
        if view.numpy.len > 0:
            if view.numpy.ints is not None:
                self.hll.value.update_np(view.numpy.ints)
Severity: Minor
Found in python/whylogs/core/metrics/metrics.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function _write_pending has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring.
Open

    def _write_pending(self) -> None:
        new_state: Dict[Writer, List[PendingWritable]] = {}
        for writer, pending in self._writers.items():
            failures: List[PendingWritable] = []
            self._logger.info(f"Writing out result set with {type(writer).__name__}")

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function _compute_ks_test_p_value has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring.
Open

def _compute_ks_test_p_value(
    target_distribution: kll_doubles_sketch,
    reference_distribution: kll_doubles_sketch,
    quantiles: Optional[List[float]] = None,
) -> Optional[ColumnDriftValue]:
Severity: Minor
Found in python/whylogs/viz/utils/drift_calculations.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Function _deserialize has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring.
Open

def _deserialize(
    token: List[str], i: int, metric: Optional[Metric] = None, profile: Optional[DatasetProfile] = None
) -> Tuple[Predicate, int]:
    if token[i] == "~":
        component, i = _get_component(token, i + 1)
Severity: Minor
Found in python/whylogs/core/predicate_parser.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

ThreadRollingLogger has 21 functions (exceeds 20 allowed). Consider refactoring.
Open

class ThreadRollingLogger(ThreadActor[LoggerMessage], DataLogger[LoggerStatus]):
    """
    A logger that manages profiles and segments for various dataset timestamps.

    This logger manages a map of dataset timestamp to dataset profile/segment and handles proper

    DistributionMetric has 21 functions (exceeds 20 allowed). Consider refactoring.
    Open

    class DistributionMetric(Metric):
        kll: KllComponent
        mean: FractionalComponent
        m2: FractionalComponent
    
    
    Severity: Minor
    Found in python/whylogs/core/metrics/metrics.py - About 2 hrs to fix

      Similar blocks of code found in 2 locations. Consider refactoring.
      Open

                  tempFeatureValues.referenceNumberSummary.histogram.counts.slice(0, 30).forEach((count, index) => {
                    featureDataForTableForAllProfiles[feature[0]].chartData[1].push({
                      axisY: count,
                      axisX: index,
                    });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 1 other location - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 1212..1217

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 77.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Similar blocks of code found in 2 locations. Consider refactoring.
      Open

                    tempFeatureValues.numberSummary.histogram.counts.slice(0, 30).forEach((count, index) => {
                      featureDataForTableForAllProfiles[feature[0]].chartData[0].push({
                        axisY: count,
                        axisX: index,
                      });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 1 other location - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 1239..1244

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 77.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Function log has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring.
      Open

          def log(
              self,
              obj: Any = None,
              *,
              pandas: Optional[pd.DataFrame] = None,
      Severity: Minor
      Found in python/whylogs/api/logger/logger.py - About 2 hrs to fix

      Cognitive Complexity

      Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

      A method's cognitive complexity is based on a few simple rules:

      • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
      • Code is considered more complex for each "break in the linear flow of the code"
      • Code is considered more complex when "flow breaking structures are nested"

      Further reading

      Function pre_install_packages has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring.
      Open

      def pre_install_packages(notebook_path, venv_dir):
          # Read the notebook and extract the pip commands
          with open(notebook_path, "r", encoding="utf-8") as f:
              nb = nbformat.read(f, as_version=4)
          pip_commands = []
      Severity: Minor
      Found in python/test_notebooks/notebook_tests.py - About 2 hrs to fix

      Cognitive Complexity

      Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

      A method's cognitive complexity is based on a few simple rules:

      • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
      • Code is considered more complex for each "break in the linear flow of the code"
      • Code is considered more complex when "flow breaking structures are nested"

      Further reading

      Function calculate_drift_scores has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring.
      Open

      def calculate_drift_scores(
          target_view: DatasetProfileView,
          reference_view: DatasetProfileView,
          drift_map: Optional[Dict[str, ColumnDriftAlgorithm]] = None,
          with_thresholds=False,
      Severity: Minor
      Found in python/whylogs/viz/drift/column_drift_algorithms.py - About 2 hrs to fix

      Cognitive Complexity

      Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

      A method's cognitive complexity is based on a few simple rules:

      • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
      • Code is considered more complex for each "break in the linear flow of the code"
      • Code is considered more complex when "flow breaking structures are nested"

      Further reading

      Function from_protobuf has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring.
      Open

          def from_protobuf(cls, msg: ColumnMessage) -> "ColumnProfileView":
              # importing to trigger registration of non-standard metrics
              import whylogs.experimental.core.metrics.udf_metric  # noqa
      
              # These require numpy & PIL, but we assume users will install
      Severity: Minor
      Found in python/whylogs/core/view/column_profile_view.py - About 2 hrs to fix

      Cognitive Complexity

      Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

      A method's cognitive complexity is based on a few simple rules:

      • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
      • Code is considered more complex for each "break in the linear flow of the code"
      • Code is considered more complex when "flow breaking structures are nested"

      Further reading

      Similar blocks of code found in 8 locations. Consider refactoring.
      Open

            feature[1].estUniqueVal.map((estUniqueVal, index) => {
              if (selectedProfiles.includes(String(index))) {
                estUniqueVal ? (estUniqueValString += `<div>${estUniqueVal}</div>`) : (estUniqueValString += `<div>$-</div>`);
              }
            });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 7 other locations - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 807..811
      python/whylogs/viz/html/js/whylogs-script.js on lines 813..817
      python/whylogs/viz/html/js/whylogs-script.js on lines 819..823
      python/whylogs/viz/html/js/whylogs-script.js on lines 831..835
      python/whylogs/viz/html/js/whylogs-script.js on lines 837..841
      python/whylogs/viz/html/js/whylogs-script.js on lines 843..847
      python/whylogs/viz/html/js/whylogs-script.js on lines 849..855

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 76.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Similar blocks of code found in 8 locations. Consider refactoring.
      Open

            feature[1].mean.forEach((mean, index) => {
              if (selectedProfiles.includes(String(index))) {
                mean ? (meanString += `<div>${mean}</div>`) : (meanString += `<div>-</div>`);
              }
            });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 7 other locations - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 807..811
      python/whylogs/viz/html/js/whylogs-script.js on lines 813..817
      python/whylogs/viz/html/js/whylogs-script.js on lines 819..823
      python/whylogs/viz/html/js/whylogs-script.js on lines 825..829
      python/whylogs/viz/html/js/whylogs-script.js on lines 837..841
      python/whylogs/viz/html/js/whylogs-script.js on lines 843..847
      python/whylogs/viz/html/js/whylogs-script.js on lines 849..855

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 76.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Similar blocks of code found in 8 locations. Consider refactoring.
      Open

            feature[1].totalCount.forEach((totalCount, index) => {
              if (selectedProfiles.includes(String(index))) {
                totalCount ? (totalCountString += `<div>${totalCount}</div>`) : (totalCountString += `<div>-</div>`);
              }
            });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 7 other locations - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 807..811
      python/whylogs/viz/html/js/whylogs-script.js on lines 819..823
      python/whylogs/viz/html/js/whylogs-script.js on lines 825..829
      python/whylogs/viz/html/js/whylogs-script.js on lines 831..835
      python/whylogs/viz/html/js/whylogs-script.js on lines 837..841
      python/whylogs/viz/html/js/whylogs-script.js on lines 843..847
      python/whylogs/viz/html/js/whylogs-script.js on lines 849..855

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 76.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Similar blocks of code found in 8 locations. Consider refactoring.
      Open

            feature[1].stddev.forEach((stddev, index) => {
              if (selectedProfiles.includes(String(index))) {
                stddev ? (stddevString += `<div>${stddev}</div>`) : (stddevString += `<div>-</div>`);
              }
            });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 7 other locations - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 807..811
      python/whylogs/viz/html/js/whylogs-script.js on lines 813..817
      python/whylogs/viz/html/js/whylogs-script.js on lines 819..823
      python/whylogs/viz/html/js/whylogs-script.js on lines 825..829
      python/whylogs/viz/html/js/whylogs-script.js on lines 831..835
      python/whylogs/viz/html/js/whylogs-script.js on lines 843..847
      python/whylogs/viz/html/js/whylogs-script.js on lines 849..855

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 76.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Similar blocks of code found in 8 locations. Consider refactoring.
      Open

            feature[1].inferredType.forEach((inferredType, index) => {
              if (selectedProfiles.includes(String(index))) {
                inferredType ? (inferredTypeString += `<div>${inferredType}</div>`) : (inferredTypeString += `<div>-</div>`);
              }
            });
      Severity: Major
      Found in python/whylogs/viz/html/js/whylogs-script.js and 7 other locations - About 2 hrs to fix
      python/whylogs/viz/html/js/whylogs-script.js on lines 813..817
      python/whylogs/viz/html/js/whylogs-script.js on lines 819..823
      python/whylogs/viz/html/js/whylogs-script.js on lines 825..829
      python/whylogs/viz/html/js/whylogs-script.js on lines 831..835
      python/whylogs/viz/html/js/whylogs-script.js on lines 837..841
      python/whylogs/viz/html/js/whylogs-script.js on lines 843..847
      python/whylogs/viz/html/js/whylogs-script.js on lines 849..855

      Duplicated Code

      Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

      Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

      When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

      Tuning

      This issue has a mass of 76.

      We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

      The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

      If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

      See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

      Refactorings

      Further Reading

      Severity
      Category
      Status
      Source
      Language