File `tokenizer.py` has 474 lines of code (exceeds 250 allowed). Consider refactoring.
Open

# Copyright 2024 The TensorFlow Authors. All Rights Reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at

Found in official/legacy/transformer/utils/tokenizer.py - About 7 hrs to fix

Function `_count_tokens` has a Cognitive Complexity of 20 (exceeds 5 allowed). Consider refactoring.
Open

def _count_tokens(files,
                  file_byte_limit=1e6,
                  correct_strip=True,
                  master_char_set=None):
  """Return token counts of words in the files.

Found in official/legacy/transformer/utils/tokenizer.py - About 2 hrs to fix

Read up
Read up

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

Cognitive Complexity

Cognitive Complexity is a measure of how difficult a unit of code is to intuitively understand. Unlike Cyclomatic Complexity, which determines how difficult your code will be to test, Cognitive Complexity tells you how difficult your code will be to read and comprehend.

A method's cognitive complexity is based on a few simple rules:

Code is not considered more complex when it uses shorthand that the language provides for collapsing multiple statements into one
Code is considered more complex for each "break in the linear flow of the code"
Code is considered more complex when "flow breaking structures are nested"

tensorflow/models

Summary

Maintainability

Test Coverage

File tokenizer.py has 474 lines of code (exceeds 250 allowed). Consider refactoring. Open

Function _count_tokens has a Cognitive Complexity of 20 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function _gen_new_subtoken_list has a Cognitive Complexity of 14 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function _generate_subtokens_with_target_vocab_size has a Cognitive Complexity of 10 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function init_from_files has 8 arguments (exceeds 4 allowed). Consider refactoring. Open

Avoid deeply nested control flow statements. Open

Avoid deeply nested control flow statements. Open

Avoid deeply nested control flow statements. Open

Function _split_string_to_tokens has a Cognitive Complexity of 8 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function _generate_subtokens_with_target_vocab_size has 6 arguments (exceeds 4 allowed). Consider refactoring. Open

Function _generate_subtokens has 5 arguments (exceeds 4 allowed). Consider refactoring. Open

Function _split_token_to_subtokens has a Cognitive Complexity of 7 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function _unescape_token has a Cognitive Complexity of 7 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

Function _count_and_gen_subtokens has a Cognitive Complexity of 6 (exceeds 5 allowed). Consider refactoring. Open

Cognitive Complexity

A method's cognitive complexity is based on a few simple rules:

Further reading

There are no issues that match your filters.

Category

Status

File `tokenizer.py` has 474 lines of code (exceeds 250 allowed). Consider refactoring.
Open

Function `_count_tokens` has a Cognitive Complexity of 20 (exceeds 5 allowed). Consider refactoring.
Open

Function `_gen_new_subtoken_list` has a Cognitive Complexity of 14 (exceeds 5 allowed). Consider refactoring.
Open

Function `_generate_subtokens_with_target_vocab_size` has a Cognitive Complexity of 10 (exceeds 5 allowed). Consider refactoring.
Open

Function `init_from_files` has 8 arguments (exceeds 4 allowed). Consider refactoring.
Open

Avoid deeply nested control flow statements.
Open

Avoid deeply nested control flow statements.
Open

Avoid deeply nested control flow statements.
Open

Function `_split_string_to_tokens` has a Cognitive Complexity of 8 (exceeds 5 allowed). Consider refactoring.
Open

Function `_generate_subtokens_with_target_vocab_size` has 6 arguments (exceeds 4 allowed). Consider refactoring.
Open

Function `_generate_subtokens` has 5 arguments (exceeds 4 allowed). Consider refactoring.
Open

Function `_split_token_to_subtokens` has a Cognitive Complexity of 7 (exceeds 5 allowed). Consider refactoring.
Open

Function `_unescape_token` has a Cognitive Complexity of 7 (exceeds 5 allowed). Consider refactoring.
Open

Function `_count_and_gen_subtokens` has a Cognitive Complexity of 6 (exceeds 5 allowed). Consider refactoring.
Open