File `tokens.py` has 603 lines of code (exceeds 250 allowed). Consider refactoring.
Open

# SPDX-FileCopyrightText: 2020 2020 Hlib Babii <hlibbabii@gmail.com>
#
# SPDX-License-Identifier: Apache-2.0

from abc import ABC, abstractmethod

Found in codeprep/preprocess/tokens.py - About 1 day to fix

Function `encode` has a Cognitive Complexity of 44 (exceeds 5 allowed). Consider refactoring.
Open

def encode(words: Dict[str, int], merges: MergeList) -> Dict[str, int]:
    letters_list = {" ".join(to_char_list(k)): v for k, v in words.items()}

    new_letters_list = {}
    for letters, freq in letters_list.items():

Found in codeprep/bpepkg/bpe_encode.py - About 6 hrs to fix

Read up
Read up

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

giganticode/codeprep

Showing 82 of 82 total issues

File tokens.py has 603 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function encode has a Cognitive Complexity of 44 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function update_neighbour_index has a Cognitive Complexity of 42 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

File wild_bpe.py has 395 lines of code (exceeds 250 allowed). Consider refactoring. Open

File text.py has 368 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function walk_and_save has a Cognitive Complexity of 32 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function get_dir_last_modification has a Cognitive Complexity of 29 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function update_location_index has a Cognitive Complexity of 28 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Dataset has 29 functions (exceeds 20 allowed). Consider refactoring. Open

File vocab.py has 302 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function run has a Cognitive Complexity of 20 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

File codestructure.py has 274 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function create_split_value has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function merge_vocab has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

File dataset.py has 260 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function run has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Consider simplifying this complex logical expression. Open

Function getsize has a Cognitive Complexity of 15 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function init_bpe_data has a Cognitive Complexity of 15 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function basic has 14 arguments (exceeds 4 allowed). Consider refactoring. Open

Severity

Category

Status

Source

Language

File `tokens.py` has 603 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `encode` has a Cognitive Complexity of 44 (exceeds 5 allowed). Consider refactoring.
Open

Function `update_neighbour_index` has a Cognitive Complexity of 42 (exceeds 5 allowed). Consider refactoring.
Open

File `wild_bpe.py` has 395 lines of code (exceeds 250 allowed). Consider refactoring.
Open

File `text.py` has 368 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `walk_and_save` has a Cognitive Complexity of 32 (exceeds 5 allowed). Consider refactoring.
Open

Function `get_dir_last_modification` has a Cognitive Complexity of 29 (exceeds 5 allowed). Consider refactoring.
Open

Function `update_location_index` has a Cognitive Complexity of 28 (exceeds 5 allowed). Consider refactoring.
Open

`Dataset` has 29 functions (exceeds 20 allowed). Consider refactoring.
Open

File `vocab.py` has 302 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `run` has a Cognitive Complexity of 20 (exceeds 5 allowed). Consider refactoring.
Open

File `codestructure.py` has 274 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `create_split_value` has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring.
Open

Function `merge_vocab` has a Cognitive Complexity of 17 (exceeds 5 allowed). Consider refactoring.
Open

File `dataset.py` has 260 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `run` has a Cognitive Complexity of 16 (exceeds 5 allowed). Consider refactoring.
Open

Consider simplifying this complex logical expression.
Open

Function `getsize` has a Cognitive Complexity of 15 (exceeds 5 allowed). Consider refactoring.
Open

Function `init_bpe_data` has a Cognitive Complexity of 15 (exceeds 5 allowed). Consider refactoring.
Open

Function `basic` has 14 arguments (exceeds 4 allowed). Consider refactoring.
Open