Unbabel/OpenKiwi

View on GitHub

Showing 161 of 264 total issues

Identical blocks of code found in 2 locations. Consider refactoring.
Open

        if old_mira:
            predicted_outputs = self.decode(instance, parts, scores)
        else:
            scores_cost = scores + p
            predicted_outputs = self.decode(instance, parts, scores_cost)
Severity: Major
Found in kiwi/models/linear/linear_word_qe_decoder.py and 1 other location - About 1 hr to fix
kiwi/models/linear/structured_decoder.py on lines 39..43

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 42.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Identical blocks of code found in 2 locations. Consider refactoring.
Open

        if index < 0 or index >= len(sentence_word_features):
            info = None
        else:
            info = sentence_word_features[index]
Severity: Major
Found in kiwi/models/linear/linear_word_qe_features.py and 1 other location - About 1 hr to fix
kiwi/models/linear/linear_word_qe_features.py on lines 120..123

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 42.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

            if use_language_model and not only_basic_features:
                self.add_binary_feature(
                    'F13=%d_%s' % (info.highest_order_ngram_left, label)
                )
                self.add_binary_feature(
Severity: Major
Found in kiwi/models/linear/linear_word_qe_features.py and 2 other locations - About 1 hr to fix
kiwi/models/linear/linear_word_qe_features.py on lines 269..272
kiwi/models/linear/linear_word_qe_features.py on lines 275..280

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 41.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 2 locations. Consider refactoring.
Open

            target_input = F.pad(
                target_input,
                pad=(0, 1),
                value=self.vocabs[const.TARGET].stoi[const.UNALIGNED],
Severity: Major
Found in kiwi/models/quetch.py and 1 other location - About 1 hr to fix
kiwi/models/quetch.py on lines 246..249

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 41.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

            if not only_basic_features:
                self.add_binary_feature('F21=%s_%s' % (info.target_pos, label))
                self.add_binary_feature(
                    'F22=%s_%s' % (info.aligned_source_pos_list, label)
Severity: Major
Found in kiwi/models/linear/linear_word_qe_features.py and 2 other locations - About 1 hr to fix
kiwi/models/linear/linear_word_qe_features.py on lines 242..247
kiwi/models/linear/linear_word_qe_features.py on lines 275..280

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 41.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 2 locations. Consider refactoring.
Open

            source_input = F.pad(
                source_input,
                pad=(0, 1),
                value=self.vocabs[const.SOURCE].stoi[const.UNALIGNED],
Severity: Major
Found in kiwi/models/quetch.py and 1 other location - About 1 hr to fix
kiwi/models/quetch.py on lines 241..244

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 41.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

            if use_unuseful_shared_task_features:
                self.add_binary_feature(
                    'F23=%d_%s' % (info.polysemy_count_source, label)
                )
                self.add_binary_feature(
Severity: Major
Found in kiwi/models/linear/linear_word_qe_features.py and 2 other locations - About 1 hr to fix
kiwi/models/linear/linear_word_qe_features.py on lines 242..247
kiwi/models/linear/linear_word_qe_features.py on lines 269..272

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 41.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

        self.S = nn.Parameter(
            torch.zeros(
                2 * self.config.hidden_pred, 2 * self.config.out_embeddings_size
Severity: Major
Found in kiwi/models/predictor.py and 2 other locations - About 1 hr to fix
kiwi/models/predictor.py on lines 160..163
kiwi/models/predictor.py on lines 166..168

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 40.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 2 locations. Consider refactoring.
Open

            target_input, nb_alignments = align_tensor(
                target_input,
                alignments,
                self.config.max_aligned,
                self.config.unaligned_idx,
Severity: Major
Found in kiwi/models/quetch.py and 1 other location - About 1 hr to fix
kiwi/models/quetch.py on lines 264..284

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 40.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

        self.V = nn.Parameter(
            torch.zeros(
                2 * self.config.target_embeddings_size,
                2 * self.config.out_embeddings_size,
Severity: Major
Found in kiwi/models/predictor.py and 2 other locations - About 1 hr to fix
kiwi/models/predictor.py on lines 166..168
kiwi/models/predictor.py on lines 171..173

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 40.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 3 locations. Consider refactoring.
Open

        self.C = nn.Parameter(
            torch.zeros(
                2 * self.config.hidden_pred, 2 * self.config.out_embeddings_size
Severity: Major
Found in kiwi/models/predictor.py and 2 other locations - About 1 hr to fix
kiwi/models/predictor.py on lines 160..163
kiwi/models/predictor.py on lines 171..173

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 40.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 2 locations. Consider refactoring.
Open

        if side == const.SOURCE_TAGS:
            alignments = [
                [alignment[::-1] for alignment in example_alignment]
                for example_alignment in alignments
            ]
Severity: Major
Found in kiwi/models/quetch.py and 1 other location - About 1 hr to fix
kiwi/models/quetch.py on lines 269..275

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 40.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--predict-gaps',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/predictor_estimator.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--binary-level',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/predictor_estimator.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--predict-target',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/quetch.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--binary-level',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/predictor_estimator.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--mlflow-always-log-artifacts',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/opts.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--freeze-embeddings',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/quetch.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--checkpoint-save',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/pipelines/train.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 348..350
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Similar blocks of code found in 30 locations. Consider refactoring.
Open

    group.add_argument(
        '--predict-source',
        type=lambda x: bool(strtobool(x)),
Severity: Major
Found in kiwi/cli/models/predictor_estimator.py and 29 other locations - About 1 hr to fix
kiwi/cli/models/nuqe.py on lines 59..61
kiwi/cli/models/predictor_estimator.py on lines 144..146
kiwi/cli/models/predictor_estimator.py on lines 276..278
kiwi/cli/models/predictor_estimator.py on lines 285..287
kiwi/cli/models/predictor_estimator.py on lines 322..324
kiwi/cli/models/predictor_estimator.py on lines 330..332
kiwi/cli/models/predictor_estimator.py on lines 339..341
kiwi/cli/models/predictor_estimator.py on lines 382..384
kiwi/cli/models/predictor_estimator.py on lines 391..393
kiwi/cli/models/predictor_estimator.py on lines 400..402
kiwi/cli/models/predictor_estimator.py on lines 415..417
kiwi/cli/models/predictor_estimator.py on lines 428..430
kiwi/cli/models/predictor_estimator.py on lines 437..439
kiwi/cli/models/predictor_estimator.py on lines 446..448
kiwi/cli/models/predictor_estimator.py on lines 484..486
kiwi/cli/models/predictor_estimator.py on lines 493..495
kiwi/cli/models/predictor_estimator.py on lines 501..503
kiwi/cli/models/predictor_estimator.py on lines 512..514
kiwi/cli/models/quetch.py on lines 129..131
kiwi/cli/models/quetch.py on lines 137..139
kiwi/cli/models/quetch.py on lines 145..147
kiwi/cli/models/quetch.py on lines 153..155
kiwi/cli/models/quetch.py on lines 217..219
kiwi/cli/models/quetch.py on lines 226..228
kiwi/cli/models/quetch.py on lines 244..246
kiwi/cli/models/quetch.py on lines 295..297
kiwi/cli/opts.py on lines 126..128
kiwi/cli/pipelines/train.py on lines 86..88
kiwi/cli/pipelines/train.py on lines 109..111

Duplicated Code

Duplicated code can lead to software that is hard to understand and difficult to change. The Don't Repeat Yourself (DRY) principle states:

Every piece of knowledge must have a single, unambiguous, authoritative representation within a system.

When you violate DRY, bugs and maintenance problems are sure to follow. Duplicated code has a tendency to both continue to replicate and also to diverge (leaving bugs as two similar implementations differ in subtle ways).

Tuning

This issue has a mass of 39.

We set useful threshold defaults for the languages we support but you may want to adjust these settings based on your project guidelines.

The threshold configuration represents the minimum mass a code block must have to be analyzed for duplication. The lower the threshold, the more fine-grained the comparison.

If the engine is too easily reporting duplication, try raising the threshold. If you suspect that the engine isn't catching enough duplication, try lowering the threshold. The best setting tends to differ from language to language.

See codeclimate-duplication's documentation for more information about tuning the mass threshold in your .codeclimate.yml.

Refactorings

Further Reading

Severity
Category
Status
Source
Language