chrislit/abydos

View on GitHub
abydos/distance/_positional_q_gram_jaccard.py

Summary

Maintainability
A
3 hrs
Test Coverage

Cyclomatic complexity is too high in method sim. (11)
Open

    def sim(self, src: str, tar: str) -> float:
        """Return the Positional Q-Gram Jaccard coefficient of two strings.

        Parameters
        ----------

Cyclomatic Complexity

Cyclomatic Complexity corresponds to the number of decisions a block of code contains plus 1. This number (also called McCabe number) is equal to the number of linearly independent paths through the code. This number can be used as a guide when testing conditional logic in blocks.

Radon analyzes the AST tree of a Python program to compute Cyclomatic Complexity. Statements have the following effects on Cyclomatic Complexity:

Construct Effect on CC Reasoning
if +1 An if statement is a single decision.
elif +1 The elif statement adds another decision.
else +0 The else statement does not cause a new decision. The decision is at the if.
for +1 There is a decision at the start of the loop.
while +1 There is a decision at the while statement.
except +1 Each except branch adds a new conditional path of execution.
finally +0 The finally block is unconditionally executed.
with +1 The with statement roughly corresponds to a try/except block (see PEP 343 for details).
assert +1 The assert statement internally roughly equals a conditional statement.
Comprehension +1 A list/set/dict comprehension of generator expression is equivalent to a for loop.
Boolean Operator +1 Every boolean operator (and, or) adds a decision point.

Source: http://radon.readthedocs.org/en/latest/intro.html

Function sim has a Cognitive Complexity of 19 (exceeds 5 allowed). Consider refactoring.
Open

    def sim(self, src: str, tar: str) -> float:
        """Return the Positional Q-Gram Jaccard coefficient of two strings.

        Parameters
        ----------
Severity: Minor
Found in abydos/distance/_positional_q_gram_jaccard.py - About 2 hrs to fix

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

  • Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
  • Code is considered more complex for each "break in the linear flow of the code"
  • Code is considered more complex when "flow breaking structures are nested"

Further reading

Cyclomatic complexity is too high in class PositionalQGramJaccard. (8)
Open

class PositionalQGramJaccard(_Distance):
    r"""Positional Q-Gram Jaccard coefficient.

    Positional Q-Gram Jaccard coefficient :cite:`Gravano:2001,Christen:2006`

Cyclomatic Complexity

Cyclomatic Complexity corresponds to the number of decisions a block of code contains plus 1. This number (also called McCabe number) is equal to the number of linearly independent paths through the code. This number can be used as a guide when testing conditional logic in blocks.

Radon analyzes the AST tree of a Python program to compute Cyclomatic Complexity. Statements have the following effects on Cyclomatic Complexity:

Construct Effect on CC Reasoning
if +1 An if statement is a single decision.
elif +1 The elif statement adds another decision.
else +0 The else statement does not cause a new decision. The decision is at the if.
for +1 There is a decision at the start of the loop.
while +1 There is a decision at the while statement.
except +1 Each except branch adds a new conditional path of execution.
finally +0 The finally block is unconditionally executed.
with +1 The with statement roughly corresponds to a try/except block (see PEP 343 for details).
assert +1 The assert statement internally roughly equals a conditional statement.
Comprehension +1 A list/set/dict comprehension of generator expression is equivalent to a for loop.
Boolean Operator +1 Every boolean operator (and, or) adds a decision point.

Source: http://radon.readthedocs.org/en/latest/intro.html

Avoid deeply nested control flow statements.
Open

                        if (
                            abs(sp - tp) <= self._max_dist
                            and sp not in src_matched
                            and tp not in tar_matched
                        ):
Severity: Major
Found in abydos/distance/_positional_q_gram_jaccard.py - About 45 mins to fix

    Refactor this function to reduce its Cognitive Complexity from 19 to the 15 allowed.
    Open

        def sim(self, src: str, tar: str) -> float:

    Cognitive Complexity is a measure of how hard the control flow of a function is to understand. Functions with high Cognitive Complexity will be difficult to maintain.

    See

    Wrong hanging indentation before block (add 4 spaces).
    Open

                                and sp not in src_matched

    TODO and sp not in src_matched ^ |

    Consider using enumerate instead of iterating with range and len
    Open

            for pos in range(len(src_list)):

    Emitted when code that iterates with range and len is encountered. Such code can be simplified by using the enumerate builtin.

    Wrong hanging indentation before block (add 4 spaces).
    Open

            max_dist: int = 1,

    TODO max_dist: int = 1, ^ |

    Wrong hanging indentation before block (add 4 spaces).
    Open

                                and tp not in tar_matched

    TODO and tp not in tar_matched ^ |

    Variable name sp doesn't conform to snake_case naming style
    Open

                    for sp in src_pos[tok]:

    Used when the name doesn't conform to naming rules associated to its type (constant, variable, class...).

    Consider using enumerate instead of iterating with range and len
    Open

            for pos in range(len(tar_list)):

    Emitted when code that iterates with range and len is encountered. Such code can be simplified by using the enumerate builtin.

    Wrong hanging indentation before block (add 4 spaces).
    Open

                                abs(sp - tp) <= self._max_dist

    TODO abs(sp - tp) <= self.maxdist ^ |

    Variable name tp doesn't conform to snake_case naming style
    Open

                        for tp in tar_pos[tok]:

    Used when the name doesn't conform to naming rules associated to its type (constant, variable, class...).

    Wrong hanging indentation before block (add 4 spaces).
    Open

            **kwargs: Any

    TODO **kwargs: Any ^ |

    Wrong hanging indentation before block (add 4 spaces).
    Open

            self,

    TODO self, ^ |

    Wrong hanging indentation before block (add 4 spaces).
    Open

            tokenizer: Optional[_Tokenizer] = None,

    TODO tokenizer: Optional[_Tokenizer] = None, ^ |

    There are no issues that match your filters.

    Category
    Status