Repository: ilevkivskyi/com2ann
Branch: master
Commit: 24a47720d48f
Files: 14
Total size: 71.4 KB

Directory structure:
gitextract_g20eikyr/

├── .flake8
├── .flake8-tests
├── .github/
│   └── workflows/
│       └── lint_python.yml
├── .gitignore
├── .pre-commit-hooks.yaml
├── LICENSE
├── MANIFEST.in
├── README.md
├── mypy.ini
├── setup.py
├── src/
│   ├── com2ann.py
│   ├── test_cli.py
│   └── test_com2ann.py
└── test-requirements.txt

================================================
FILE CONTENTS
================================================

================================================
FILE: .flake8
================================================
[flake8]
max-line-length = 90
ignore =
  B3,
  DW12,
  W504,
  W503,
  E203
exclude =
  src/test_com2ann.py


================================================
FILE: .flake8-tests
================================================
[flake8]
max-line-length = 110
ignore =
  E306,
  E701,
  E704,
  F811,
  B3,
  DW12
exclude =
  src/com2ann.py


================================================
FILE: .github/workflows/lint_python.yml
================================================
name: lint_python
on: [pull_request, push]
jobs:
  lint_python:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v3
      - uses: actions/setup-python@v4
        with:
          python-version: 3.x
      - run: pip install --upgrade pip wheel
      - run: pip install -r test-requirements.txt
      - run: bandit --recursive --skip B101 .  # B101 is assert statements
      - run: black --check .
      - run: codespell --ignore-words-list="fo"
      - run: flake8 . --count --max-complexity=12 --max-line-length=89 --show-source --statistics
      - run: isort --check-only --profile black .
      - run: |
          pip install --editable .
          mkdir --parents --verbose .mypy_cache
      - run: mypy --ignore-missing-imports --install-types --non-interactive .
      - run: pytest .


================================================
FILE: .gitignore
================================================
# Byte-compiled / optimized / DLL files
__pycache__/
*.py[cod]
*$py.class

# C extensions
*.so

# Distribution / packaging
.Python
env/
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
*.egg-info/
.installed.cfg
*.egg

# PyInstaller
#  Usually these files are written by a python script from a template
#  before PyInstaller builds the exe, so as to inject date/other infos into it.
*.manifest
*.spec

# Installer logs
pip-log.txt
pip-delete-this-directory.txt

# Unit test / coverage reports
htmlcov/
.tox/
.coverage
.coverage.*
.cache
nosetests.xml
coverage.xml
*,cover
.hypothesis/

# Translations
*.mo
*.pot

# Django stuff:
*.log
local_settings.py

# Flask stuff:
instance/
.webassets-cache

# Scrapy stuff:
.scrapy

# Sphinx documentation
docs/_build/

# PyBuilder
target/

# IPython Notebook
.ipynb_checkpoints

# pyenv
.python-version

# celery beat schedule file
celerybeat-schedule

# dotenv
.env

# virtualenv
venv/
ENV/

# Spyder project settings
.spyderproject

# Rope project settings
.ropeproject
.idea


================================================
FILE: .pre-commit-hooks.yaml
================================================
- id: com2ann
  name: com2ann
  entry: com2ann
  language: python
  types: [ file ]
  files: .*\.(py|pyi|md|rst)

================================================
FILE: LICENSE
================================================
The MIT License (MIT)

Copyright (c) 2017-2019 Ivan Levkivskyi

Permission is hereby granted, free of charge, to any person obtaining a copy of
this software and associated documentation files (the "Software"), to deal in
the Software without restriction, including without limitation the rights to
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of
the Software, and to permit persons to whom the Software is furnished to do so,
subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS
FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR
COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER
IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.


================================================
FILE: MANIFEST.in
================================================
include LICENSE
include README.md
include setup.py
include src/com2ann.py
include src/test_com2ann.py


================================================
FILE: README.md
================================================
com2ann
=======

[![Build Status](https://travis-ci.org/ilevkivskyi/com2ann.svg)](https://travis-ci.org/ilevkivskyi/com2ann)
[![Checked with mypy](http://www.mypy-lang.org/static/mypy_badge.svg)](http://mypy-lang.org/)

Tool for translation of type comments to type annotations in Python.

The tool requires Python 3.8 to run. But the supported target code version
is Python 3.4+ (can be specified with `--python-minor-version`).

Currently, the tool translates function and assignment type comments to
type annotations. For example this code:
```python
from typing import Optional, Final

MAX_LEVEL = 80  # type: Final

class Template:
    default = None  # type: Optional[str]

    def apply(self, value, **opts):
        # type: (str, **bool) -> str
        ...
```
will be translated to:
```python
from typing import Optional, Final

MAX_LEVEL: Final = 80

class Template:
    default: Optional[str] = None

    def apply(self, value: str, **opts: bool) -> str:
        ...
```

The philosophy of the tool is to be minimally invasive, and preserve original
formatting as much as possible. This is why the tool doesn't use un-parse.

The only (optional) formatting code modification is wrapping long function
signatures. To specify the maximal length, use `--wrap-signatures MAX_LENGTH`.
The signatures are wrapped one argument per line (after each comma), for example:
```python
    def apply(self,
              value: str,
              **opts: bool) -> str:
        ...
```

For working with stubs, there are two additional options for assignments:
`--drop-ellipsis` and `--drop-none`. They will result in omitting the redundant
right hand sides. For example, this:
```python
var = ...  # type: List[int]
class Test:
    attr = None  # type: str
```
will be translated with such options to:
```python
var: List[int]
class Test:
    attr: str
```
### Usage
$ `com2ann --help`
```
usage: com2ann [-h] [-o OUTFILE] [-s] [-n] [-e] [-i] [-w WRAP_SIGNATURES]
               [-v PYTHON_MINOR_VERSION]
               infile

Helper module to translate type comments to type annotations. The key idea of
this module is to perform the translation while preserving the original
formatting as much as possible. We try to be not opinionated about code
formatting and therefore work at the source code and tokenizer level instead
of modifying AST and using un-parse. We are especially careful about
assignment statements, and keep the placement of additional (non-type)
comments. For function definitions, we might introduce some formatting
modifications, if the original formatting was too tricky.

positional arguments:
  infile                input file or directory for translation, must contain
                        no syntax errors; if --outfile is not given,
                        translation is made *in place*

optional arguments:
  -h, --help            show this help message and exit
  -o OUTFILE, --outfile OUTFILE
                        output file or directory, will be overwritten if
                        exists, defaults to input file or directory
  -s, --silent          do not print summary for line numbers of translated
                        and rejected comments
  -n, --drop-none       drop any None as assignment value during translation
                        if it is annotated by a type comment
  -e, --drop-ellipsis   drop any Ellipsis (...) as assignment value during
                        translation if it is annotated by a type comment
  -i, --add-future-imports
                        add 'from __future__ import annotations' to any file
                        where type comments were successfully translated
  -w WRAP_SIGNATURES, --wrap-signatures WRAP_SIGNATURES
                        wrap function headers that are longer than given
                        length
  -v PYTHON_MINOR_VERSION, --python-minor-version PYTHON_MINOR_VERSION
                        Python 3 minor version to use to parse the files
```


================================================
FILE: mypy.ini
================================================
[mypy]
disallow_untyped_calls = True
disallow_untyped_defs = True
disallow_incomplete_defs = True
check_untyped_defs = True
disallow_subclassing_any = True
warn_no_return = True
strict_optional = True
no_implicit_optional = True
disallow_any_generics = True
disallow_any_unimported = True
warn_redundant_casts = True
warn_unused_configs = True
strict_equality = True


================================================
FILE: setup.py
================================================
#!/usr/bin/env python

import sys

from setuptools import setup

if sys.version_info < (3, 8, 0):
    sys.stderr.write("ERROR: You need Python 3.8 or later to use com2ann.\n")
    exit(1)

version = "0.3.0"
description = "Tool to translate type comments to annotations."
long_description = """
com2ann
=======

Tool for translation of type comments to type annotations in Python.

This tool requires Python 3.8 to run. But the supported target code version
is Python 3.4+ (can be specified with ``--python-minor-version``).

Currently, the tool translates function and assignment type comments to
type annotations.

The philosophy of of the tool is too minimally invasive, and preserve original
formatting as much as possible. This is why the tool doesn't use un-parse.
""".lstrip()

classifiers = [
    "Development Status :: 3 - Alpha",
    "Environment :: Console",
    "Intended Audience :: Developers",
    "License :: OSI Approved :: MIT License",
    "Operating System :: OS Independent",
    "Programming Language :: Python :: 3.8",
    "Topic :: Software Development",
]

setup(
    name="com2ann",
    version=version,
    description=description,
    long_description=long_description,
    author="Ivan Levkivskyi",
    author_email="levkivskyi@gmail.com",
    url="https://github.com/ilevkivskyi/com2ann",
    license="MIT",
    keywords="typing function annotations type hints "
    "type comments variable annotations",
    python_requires=">=3.8",
    package_dir={"": "src"},
    py_modules=["com2ann"],
    entry_points={"console_scripts": ["com2ann=com2ann:main"]},
    classifiers=classifiers,
)


================================================
FILE: src/com2ann.py
================================================
"""Helper module to translate type comments to type annotations.

The key idea of this module is to perform the translation while preserving
the original formatting as much as possible. We try to be not opinionated
about code formatting and therefore work at the source code and tokenizer level
instead of modifying AST and using un-parse.

We are especially careful about assignment statements, and keep the placement
of additional (non-type) comments. For function definitions, we might introduce
some formatting modifications, if the original formatting was too tricky.
"""

from __future__ import annotations

import argparse
import ast
import pathlib
import re
import sys
import tokenize
from collections import defaultdict
from dataclasses import dataclass
from enum import Enum, auto
from io import BytesIO
from tokenize import TokenInfo
from typing import Any, DefaultDict, Sequence, Union

__all__ = ["com2ann", "TYPE_COM"]

TYPE_COM = re.compile(r"\s*#\s*type\s*:(.*)$", flags=re.DOTALL)

# For internal use only.
_TRAILER = re.compile(r"\s*$", flags=re.DOTALL)
_NICE_IGNORE = re.compile(r"\s*# type: ignore(\[\S+\])?\s*$", flags=re.DOTALL)

FUTURE_IMPORT_WHITELIST = {"str", "int", "bool", "None"}

Unsupported = Union[ast.For, ast.AsyncFor, ast.With, ast.AsyncWith]
Function = Union[ast.FunctionDef, ast.AsyncFunctionDef]


@dataclass
class Options:
    """Config options, see details in main()."""

    drop_none: bool
    drop_ellipsis: bool
    silent: bool
    add_future_imports: bool = False
    wrap_signatures: int = 0
    python_minor_version: int = -1


class RvalueKind(Enum):
    """Special cases for assignment r.h.s."""

    OTHER = auto()
    TUPLE = auto()
    NONE = auto()
    ELLIPSIS = auto()


@dataclass
class AssignData:
    """Location data for translating assignment type comment."""

    type_comment: str

    # Position where l.h.s. ends (may not include closing paren).
    lvalue_end_line: int
    lvalue_end_offset: int

    # Position range for the r.h.s. (may also not include parentheses
    # if they are redundant).
    rvalue_start_line: int
    rvalue_start_offset: int
    rvalue_end_line: int
    rvalue_end_offset: int

    # Is there any r.h.s. that requires special treatment.
    rvalue_kind: RvalueKind = RvalueKind.OTHER


@dataclass
class ArgComment:
    """Location data for insertion of an argument annotation."""

    type_comment: str

    # Place where a given argument ends, insert an annotation here.
    arg_line: int
    arg_end_offset: int

    has_default: bool = False


@dataclass
class FunctionData:
    """Location data for translating function comment."""

    arg_types: list[ArgComment]
    ret_type: str | None

    # The line where 'def' appears.
    header_start_line: int
    # This doesn't include any comments or whitespace-only lines.
    body_first_line: int


class FileData:
    """Internal class describing global data on file."""

    def __init__(
        self, lines: list[str], tokens: list[TokenInfo], tree: ast.AST
    ) -> None:
        # Source code lines.
        self.lines = lines
        # Tokens for the source code.
        self.tokens = tokens
        # Parsed tree (with type_comments = True).
        self.tree = tree

        # Map line number to token numbers. For example {1: [0, 1, 2, 3], 2: [4, 5]}
        # means that first to fourth tokens are on the first line.
        token_tab: DefaultDict[int, list[int]] = defaultdict(list)
        for i, tok in enumerate(tokens):
            token_tab[tok.start[0]].append(i)
        self.token_tab = token_tab

        # Basic translation logging.
        self.success: list[int] = (
            []
        )  # list of lines where type comments where processed
        self.fail: list[int] = []  # list of lines where type comments where rejected

        # Types we have inserted during translation.
        self.seen: set[str] = set()


class TypeCommentCollector(ast.NodeVisitor):
    """Visitor to collect type comments from an AST.

    This also records other necessary information such as location data for
    various nodes and their kinds.
    """

    def __init__(self, silent: bool) -> None:
        super().__init__()
        self.silent = silent
        # Type comments we can translate.
        self.found: list[AssignData | FunctionData] = []
        # Type comments that are not supported yet (for reporting).
        self.found_unsupported: list[int] = []

    def visit_Assign(self, s: ast.Assign) -> None:
        if s.type_comment:
            if not check_target(s):
                self.found_unsupported.append(s.lineno)
                return
            target = s.targets[0]
            value = s.value

            # These may require special treatment.
            if isinstance(value, ast.Tuple):
                rvalue_kind = RvalueKind.TUPLE
            elif isinstance(value, ast.Constant) and value.value is None:
                rvalue_kind = RvalueKind.NONE
            elif isinstance(value, ast.Constant) and value.value is Ellipsis:
                rvalue_kind = RvalueKind.ELLIPSIS
            else:
                rvalue_kind = RvalueKind.OTHER

            assert (
                target.end_lineno
                and target.end_col_offset
                and value.end_lineno
                and value.end_col_offset
            )
            found = AssignData(
                s.type_comment,
                target.end_lineno,
                target.end_col_offset,
                value.lineno,
                value.col_offset,
                value.end_lineno,
                value.end_col_offset,
                rvalue_kind,
            )
            self.found.append(found)

    def visit_For(self, o: ast.For) -> None:
        self.visit_unsupported(o)

    def visit_AsyncFor(self, o: ast.AsyncFor) -> None:
        self.visit_unsupported(o)

    def visit_With(self, o: ast.With) -> None:
        self.visit_unsupported(o)

    def visit_AsyncWith(self, o: ast.AsyncWith) -> None:
        self.visit_unsupported(o)

    def visit_unsupported(self, o: Unsupported) -> None:
        if o.type_comment:
            self.found_unsupported.append(o.lineno)
        self.generic_visit(o)

    def visit_FunctionDef(self, fdef: ast.FunctionDef) -> None:
        self.visit_function_impl(fdef)

    def visit_AsyncFunctionDef(self, fdef: ast.AsyncFunctionDef) -> None:
        self.visit_function_impl(fdef)

    def visit_function_impl(self, fdef: Function) -> None:
        if (
            fdef.type_comment
            or any(a.type_comment for a in fdef.args.args)
            or any(a.type_comment for a in fdef.args.kwonlyargs)
            or fdef.args.vararg
            and fdef.args.vararg.type_comment
            or fdef.args.kwarg
            and fdef.args.kwarg.type_comment
        ):

            # Number of non-default positional arguments.
            num_non_defs = len(fdef.args.args) - len(fdef.args.defaults)

            # Positions of non-default keyword-only arguments.
            kw_non_defs = {i for i, d in enumerate(fdef.args.kw_defaults) if d is None}

            args = self.process_per_arg_comments(fdef, num_non_defs, kw_non_defs)

            ret: str | None
            if fdef.type_comment:
                res = split_function_comment(fdef.type_comment, self.silent)
                if not res:
                    self.found_unsupported.append(fdef.lineno)
                    return
                f_args, ret = res
            else:
                f_args, ret = [], None

            if args and f_args:
                if not self.silent:
                    print(
                        f'Both per-argument and function comments for "{fdef.name}"',
                        file=sys.stderr,
                    )
                self.found_unsupported.append(fdef.lineno)
                return

            body_start = fdef.body[0].lineno
            if isinstance(
                fdef.body[0], (ast.AsyncFunctionDef, ast.FunctionDef, ast.ClassDef)
            ):
                # We need to compensate for decorators, because the first line of a
                # class/function is the line where 'class' or 'def' appears.
                if fdef.body[0].decorator_list:
                    body_start = min(it.lineno for it in fdef.body[0].decorator_list)
            if args:
                self.found.append(FunctionData(args, ret, fdef.lineno, body_start))
            elif not f_args:
                self.found.append(FunctionData([], ret, fdef.lineno, body_start))
            else:
                c_args = self.process_function_comment(fdef, f_args, num_non_defs)
                if c_args is None:
                    # There was an error processing comment.
                    return
                self.found.append(FunctionData(c_args, ret, fdef.lineno, body_start))
        self.generic_visit(fdef)

    def process_per_arg_comments(
        self, fdef: Function, num_non_defs: int, kw_non_defs: set[int]
    ) -> list[ArgComment]:
        """Collect information about per-argument function comments.

        These comments look like:

            def func(
                arg1,  # type: Type1
                arg2,  # type: Type2
            ):
                ...
        """
        args: list[ArgComment] = []

        for i, a in enumerate(fdef.args.args):
            if a.type_comment:
                assert a.end_col_offset
                args.append(
                    ArgComment(
                        a.type_comment, a.lineno, a.end_col_offset, i >= num_non_defs
                    )
                )
        if fdef.args.vararg and fdef.args.vararg.type_comment:
            vararg = fdef.args.vararg
            assert vararg.end_col_offset
            args.append(
                ArgComment(
                    fdef.args.vararg.type_comment,
                    vararg.lineno,
                    vararg.end_col_offset,
                    False,
                )
            )

        for i, a in enumerate(fdef.args.kwonlyargs):
            if a.type_comment:
                assert a.end_col_offset
                args.append(
                    ArgComment(
                        a.type_comment, a.lineno, a.end_col_offset, i not in kw_non_defs
                    )
                )
        if fdef.args.kwarg and fdef.args.kwarg.type_comment:
            kwarg = fdef.args.kwarg
            assert kwarg.end_col_offset
            args.append(
                ArgComment(
                    fdef.args.kwarg.type_comment,
                    kwarg.lineno,
                    kwarg.end_col_offset,
                    False,
                )
            )
        return args

    def process_function_comment(
        self, fdef: Function, f_args: list[str], num_non_defs: int
    ) -> list[ArgComment] | None:
        """Combine location data for function arguments with types from a comment.

        f_args contains already split argument strings from the function type comment,
        for example if the comment is # type: (int, str) -> None, the f_args should be
        ['int', 'str'].
        """
        args: list[ArgComment] = []

        tot_args = len(fdef.args.args) + len(fdef.args.kwonlyargs)
        if fdef.args.vararg:
            tot_args += 1
        if fdef.args.kwarg:
            tot_args += 1

        # One is only allowed to skip annotation for self or cls.
        if len(f_args) not in (tot_args, tot_args - 1):
            if not self.silent:
                print(
                    f'Invalid number of arguments in comment for "{fdef.name}"',
                    file=sys.stderr,
                )
            self.found_unsupported.append(fdef.lineno)
            return None

        # The list of arguments we need to annotate.
        if len(f_args) == tot_args - 1:
            iter_args = fdef.args.args[1:]
        else:
            iter_args = fdef.args.args.copy()

        # Extend the list with other possible arguments.
        if fdef.args.vararg:
            iter_args.append(fdef.args.vararg)
        iter_args.extend(fdef.args.kwonlyargs)
        if fdef.args.kwarg:
            iter_args.append(fdef.args.kwarg)

        # Combine arguments locations with corresponding comments.
        for typ, a in zip(f_args, iter_args):
            has_default = False
            if a in fdef.args.args and fdef.args.args.index(a) >= num_non_defs:
                has_default = True

            kwonlyargs = fdef.args.kwonlyargs
            if a in kwonlyargs and fdef.args.kw_defaults[kwonlyargs.index(a)]:
                has_default = True

            assert a.end_col_offset
            args.append(ArgComment(typ, a.lineno, a.end_col_offset, has_default))
        return args


def split_sub_comment(comment: str) -> tuple[str, str | None]:
    """Split extra comment from a type comment.

    The only non-trivial thing here is to take care of literal types,
    that can contain arbitrary chars, including '#'.
    """
    rl = BytesIO(comment.encode("utf-8")).readline
    tokens = list(tokenize.tokenize(rl))

    i_sub = None
    for i, tok in enumerate(tokens):
        if tok.exact_type == tokenize.COMMENT:
            _, i_sub = tokens[i - 1].end

    if i_sub is not None:
        return comment[:i_sub], comment[i_sub:]
    return comment, None


def split_function_comment(
    comment: str, silent: bool = False
) -> tuple[list[str], str] | None:
    """Split function type comment into argument types and return types.

    This also removes any additional sub-comment. For example:

        # type: (int, str) -> None  # some explanation

    is transformed into: ['int', 'str'], 'None'.
    """
    typ, _ = split_sub_comment(comment)
    if typ.count("->") != 1:
        if not silent:
            print("Invalid function type comment:", comment, file=sys.stderr)
        return None

    # TODO: ()->int vs () -> int -- keep spacing (also # type:int vs # type: int).
    arg_list, ret = typ.split("->")

    arg_list = arg_list.strip()
    ret = ret.strip()

    if not (arg_list[0] == "(" and arg_list[-1] == ")"):
        if not silent:
            print("Invalid function type comment:", comment, file=sys.stderr)
        return None

    arg_list = arg_list[1:-1]
    args: list[str] = []

    # TODO: use tokenizer to guard against Literal[','].
    next_arg = ""
    nested = 0
    for c in arg_list:
        if c in "([{":
            nested += 1
        if c in ")]}":
            nested -= 1
        if c == "," and not nested:
            args.append(next_arg.strip())
            next_arg = ""
        else:
            next_arg += c

    if next_arg:
        args.append(next_arg.strip())

    # Currently mypy just ignores * and ** and just gets the argument kind from the
    # function header, so we don't need any additional checks.
    return [a.lstrip("*") for a in args if a != "..."], ret


def strip_type_comment(line: str) -> str:
    """Remove any type comments from this line.

    We however keep # type: ignore comments, and any sub-comments.
    This raises if there is no type comment found.
    """
    match = re.search(TYPE_COM, line)
    assert match, line
    if match.group(1).lstrip().startswith("ignore"):
        # Keep # type: ignore[=code] comments.
        return line
    rest = line[: match.start()]

    typ = match.group(1)
    _, sub_comment = split_sub_comment(typ)
    if sub_comment is None:
        # Just keep exactly the same kind of endline.
        trailer = re.search(_TRAILER, typ)
        assert trailer
        sub_comment = typ[trailer.start() :]

    if rest:
        new_line = rest + sub_comment
    else:
        # A type comment on line of its own.
        new_line = line[: line.index("#")] + sub_comment.lstrip(" \t")
    return new_line


def string_insert(line: str, extra: str, pos: int) -> str:
    return line[:pos] + extra + line[pos:]


def process_assign(
    comment: AssignData, data: FileData, drop_none: bool, drop_ellipsis: bool
) -> None:
    """Process type comment in an assignment statement.

    Remove the matching r.h.s. if drop_none or drop_ellipsis is True.
    For example:

        x = ...  # type: int

    will be translated to

        x: int
    """
    lines = data.lines

    # In ast module line numbers start from 1, not 0.
    rv_end = comment.rvalue_end_line - 1
    rv_start = comment.rvalue_start_line - 1

    # We perform the tasks in order from larger line/columns to smaller ones
    # to avoid shuffling the line column numbers in following code.
    # First remove the type comment.
    match = re.search(TYPE_COM, lines[rv_end])
    if match and not match.group(1).lstrip().startswith("ignore"):
        lines[rv_end] = strip_type_comment(lines[rv_end])
    else:
        # Special case: type comment moved to a separate continuation line.
        # There two ways to have continuation...
        assert lines[rv_end].rstrip().endswith("\\") or lines[  # ... a slash
            rv_end + 1
        ].lstrip().startswith(
            ")"
        )  # ... inside parentheses

        lines[rv_end + 1] = strip_type_comment(lines[rv_end + 1])
        if not lines[rv_end + 1].strip():
            del lines[rv_end + 1]
            # Also remove the \ symbol from the previous line, but keep
            # the original line ending.
            trailer = re.search(_TRAILER, lines[rv_end])
            assert trailer
            lines[rv_end] = lines[rv_end].rstrip()[:-1].rstrip() + trailer.group()

    # Second we take care of r.h.s. special cases.
    if comment.rvalue_kind == RvalueKind.TUPLE:
        # TODO: take care of (1, 2), (3, 4) with matching pars.
        if not (
            lines[rv_start][comment.rvalue_start_offset] == "("
            and lines[rv_end][comment.rvalue_end_offset - 1] == ")"
        ):
            # We need to wrap rvalue in parentheses before Python 3.8,
            # because x: Tuple[int, ...] = 1, 2, 3 used to be a syntax error.
            end_line = lines[rv_end]
            lines[rv_end] = string_insert(end_line, ")", comment.rvalue_end_offset)

            start_line = lines[rv_start]
            lines[rv_start] = string_insert(
                start_line, "(", comment.rvalue_start_offset
            )

            if comment.rvalue_end_line > comment.rvalue_start_line:
                # Add a space to fix indentation after inserting paren.
                for i in range(comment.rvalue_end_line, comment.rvalue_start_line, -1):
                    if lines[i - 1].strip():
                        lines[i - 1] = " " + lines[i - 1]

    elif (
        comment.rvalue_kind == RvalueKind.NONE
        and drop_none
        or comment.rvalue_kind == RvalueKind.ELLIPSIS
        and drop_ellipsis
    ):
        # TODO: more tricky (multi-line) cases.
        if comment.lvalue_end_line == comment.rvalue_end_line:
            line = lines[comment.lvalue_end_line - 1]
            lines[comment.lvalue_end_line - 1] = (
                line[: comment.lvalue_end_offset] + line[comment.rvalue_end_offset :]
            )

    # Finally we insert the annotation.
    lvalue_line = lines[comment.lvalue_end_line - 1]
    typ, _ = split_sub_comment(comment.type_comment)
    data.seen.add(typ)

    # Take care of '(foo) = bar  # type: baz'.
    # TODO: this is pretty ad hoc.
    while (
        comment.lvalue_end_offset < len(lvalue_line)
        and lvalue_line[comment.lvalue_end_offset] == ")"
    ):
        comment.lvalue_end_offset += 1

    lines[comment.lvalue_end_line - 1] = (
        lvalue_line[: comment.lvalue_end_offset]
        + ": "
        + typ
        + lvalue_line[comment.lvalue_end_offset :]
    )


def insert_arg_type(line: str, arg: ArgComment, seen: set[str]) -> str:
    """Insert the argument type at a given location.

    Also record the type we translated.
    """
    typ, _ = split_sub_comment(arg.type_comment)
    seen.add(typ)

    new_line = line[: arg.arg_end_offset] + ": " + typ

    rest = line[arg.arg_end_offset :]
    if not arg.has_default:
        return new_line + rest

    # Here we are a bit opinionated about spacing (see PEP 8).
    rest = rest.lstrip()
    assert rest[0] == "="
    rest = rest[1:].lstrip()

    return new_line + " = " + rest


def wrap_function_header(header: str) -> list[str]:
    """Wrap long function signature (header) one argument per line.

    Currently only headers that are initially one-line are supported.
    For example:

        def foo(arg1: LongType1, arg2: LongType2) -> None:
            ...

    becomes

        def foo(arg1: LongType1,
                arg2: LongType2) -> None:
            ...
    """
    # TODO: use tokenizer to guard against Literal[','].
    parts: list[str] = []
    next_part = ""
    nested = 0
    complete = False  # Did we split all the arguments inside (...)?
    indent: int | None = None

    for i, c in enumerate(header):
        if c in "([{":
            nested += 1
            if c == "(" and indent is None:
                indent = i + 1
        if c in ")]}":
            nested -= 1
            if not nested:
                # To avoid splitting return types that also have commas.
                complete = True
        if c == "," and nested == 1 and not complete:
            next_part += c
            parts.append(next_part)
            next_part = ""
        else:
            next_part += c

    parts.append(next_part)

    if len(parts) == 1:
        return parts

    # Indent all the wrapped lines.
    assert indent is not None
    parts = [parts[0]] + [" " * indent + p.lstrip(" \t") for p in parts[1:]]

    # Add line endings like in the original header.
    trailer = re.search(_TRAILER, header)
    assert trailer
    end_line = header[trailer.start() :].lstrip(" \t")
    parts = [p + end_line for p in parts[:-1]] + [parts[-1]]

    # TODO: handle type ignores better.
    ignore = re.search(_NICE_IGNORE, parts[-1])
    if ignore:
        # We should keep # type: ignore on the first line of the wrapped header.
        last = parts[-1]
        first = parts[0]
        first_trailer = re.search(_TRAILER, first)
        assert first_trailer
        parts[0] = first[: first_trailer.start()] + ignore.group()
        parts[-1] = last[: ignore.start()] + first_trailer.group()

    return parts


def process_func_def(func_type: FunctionData, data: FileData, wrap_sig: int) -> None:
    """Perform translation for an (async) function definition.

    This supports two main ways of adding type comments for argument:

        def one(
            arg,  # type: Type
        ):
            ...

        def another(arg):
            # type: (Type) -> AnotherType
    """
    lines = data.lines

    # Find line and column where _actual_ colon is located.
    ret_line = func_type.body_first_line - 1
    ret_line -= 1
    while not lines[ret_line].split("#")[0].strip():
        ret_line -= 1

    colon = None
    for i in reversed(data.token_tab[ret_line + 1]):
        if data.tokens[i].exact_type == tokenize.COLON:
            _, colon = data.tokens[i].start
            break
    assert colon is not None

    # Note that -1 offset is because line numbers starts from 1 in ast module.
    for i in range(func_type.body_first_line - 2, func_type.header_start_line - 2, -1):
        if re.search(TYPE_COM, lines[i]):
            lines[i] = strip_type_comment(lines[i])
            if not lines[i].strip():
                if i > ret_line:
                    del lines[i]
                else:
                    # Removing an empty line in argument list is unsafe, since it
                    # can cause shuffling of following line numbers.
                    # TODO: find a cleaner fix.
                    lines[i] = ""

    # Inserting return type is a bit dirty...
    if func_type.ret_type:
        data.seen.add(func_type.ret_type)
        right_par = lines[ret_line][:colon].rindex(")")
        lines[ret_line] = (
            lines[ret_line][: right_par + 1]
            + " -> "
            + func_type.ret_type
            + lines[ret_line][colon:]
        )

    # Inserting argument types is pretty straightforward.
    for arg in reversed(func_type.arg_types):
        lines[arg.arg_line - 1] = insert_arg_type(
            lines[arg.arg_line - 1], arg, data.seen
        )

    # Finally wrap the translated function header if needed.
    if ret_line == func_type.header_start_line - 1:
        header = data.lines[ret_line]
        if wrap_sig and len(header) > wrap_sig:
            data.lines[ret_line : ret_line + 1] = wrap_function_header(header)


def com2ann_impl(
    data: FileData,
    drop_none: bool,
    drop_ellipsis: bool,
    wrap_sig: int = 0,
    silent: bool = True,
    add_future_imports: bool = False,
) -> str:
    """Collect type annotations in AST and perform code translation.

    Add the future import if necessary. Currently only type comments in
    functions and simple assignments are supported.
    """
    finder = TypeCommentCollector(silent)
    finder.visit(data.tree)

    data.fail.extend(finder.found_unsupported)
    found = list(reversed(finder.found))

    # Perform translations in reverse order to avoid shuffling line numbers.
    for item in found:
        if isinstance(item, AssignData):
            process_assign(item, data, drop_none, drop_ellipsis)
            data.success.append(item.lvalue_end_line)
        elif isinstance(item, FunctionData):
            process_func_def(item, data, wrap_sig)
            data.success.append(item.header_start_line)

    if add_future_imports and data.success and not data.seen <= FUTURE_IMPORT_WHITELIST:
        # Find first non-trivial line of code.
        i = 0
        while not data.lines[i].split("#")[0].strip():
            i += 1
        trailer = re.search(_TRAILER, data.lines[i])
        assert trailer
        data.lines.insert(i, "from __future__ import annotations" + trailer.group())

    return "".join(data.lines)


def check_target(assign: ast.Assign) -> bool:
    """Check if the statement is suitable for annotation.

    Type comments can placed on with and for statements, but
    annotation can be placed only on an simple assignment with a single target.
    """
    if len(assign.targets) == 1:
        target = assign.targets[0]
    else:
        return False
    if (
        isinstance(target, ast.Name)
        or isinstance(target, ast.Attribute)
        or isinstance(target, ast.Subscript)
    ):
        return True
    return False


def com2ann(
    code: str,
    *,
    drop_none: bool = False,
    drop_ellipsis: bool = False,
    silent: bool = False,
    add_future_imports: bool = False,
    wrap_sig: int = 0,
    python_minor_version: int = -1,
) -> tuple[str, FileData] | None:
    """Translate type comments to type annotations in code.

    Take code as string and return this string where::

      variable = value  # type: annotation  # real comment

    is translated to::

      variable: annotation = value  # real comment

    For unsupported syntax cases, the type comments are
    left intact. If drop_None is True or if drop_Ellipsis
    is True translate correspondingly::

      variable = None  # type: annotation
      variable = ...  # type: annotation

    into::

      variable: annotation

    The tool tries to preserve code formatting as much as
    possible, but an exact translation is not guaranteed.
    A summary of translated comments id printed by default.
    """
    try:
        # We want to work only with file without syntax errors
        tree = ast.parse(
            code, type_comments=True, feature_version=(3, python_minor_version)
        )
    except SyntaxError as exc:
        print("SyntaxError:", exc, file=sys.stderr)
        return None
    lines = code.splitlines(keepends=True)
    rl = BytesIO(code.encode("utf-8")).readline
    tokens = list(tokenize.tokenize(rl))

    data = FileData(lines, tokens, tree)
    new_code = com2ann_impl(
        data, drop_none, drop_ellipsis, wrap_sig, silent, add_future_imports
    )

    if not silent:
        if data.success:
            print(
                "Comments translated for statements on lines:",
                ", ".join(str(lno) for lno in data.success),
            )
        if data.fail:
            print(
                "Comments skipped for statements on lines:",
                ", ".join(str(lno) for lno in data.fail),
            )
        if not data.success and not data.fail:
            print("No type comments found")

    return new_code, data


def translate_file(infile: str, outfile: str, options: Options) -> None:
    try:
        opened = tokenize.open(infile)
    except SyntaxError:
        print("Cannot open", infile, file=sys.stderr)
        return
    with opened as f:
        code = f.read()
        enc = f.encoding
    if not options.silent:
        print("File:", infile)

    future_imports = options.add_future_imports
    if outfile.endswith(".pyi"):
        future_imports = False

    try:
        result = com2ann(
            code,
            drop_none=options.drop_none,
            drop_ellipsis=options.drop_ellipsis,
            silent=options.silent,
            add_future_imports=future_imports,
            wrap_sig=options.wrap_signatures,
            python_minor_version=options.python_minor_version,
        )
    except Exception:
        print(f"INTERNAL ERROR while processing {infile}", file=sys.stderr)
        print(
            "Please report bug at https://github.com/ilevkivskyi/com2ann/issues",
            file=sys.stderr,
        )
        raise

    if result is None:
        print("SyntaxError in", infile, file=sys.stderr)
        return
    new_code, _ = result
    with open(outfile, "wb") as fo:
        fo.write(new_code.encode(enc))


def parse_cli_args(args: Sequence[str] | None = None) -> dict[str, Any]:
    parser = argparse.ArgumentParser(description=__doc__)
    parser.add_argument(
        "infiles",
        nargs="*",
        type=pathlib.Path,
        help="input files or directories for translation, must\n"
        "contain no syntax errors;\n"
        "if --outfile is not given or multiple input files are\n"
        "given, translation is made *in place*",
    )

    parser.add_argument(
        "-o",
        "--outfile",
        type=pathlib.Path,
        help="output file or directory, will be overwritten if exists,\n"
        "defaults to input file or directory. Cannot be used\n"
        "when multiple input files are defined. infiles and\n"
        "outfile must be of the same type (file vs. directory)",
    )

    parser.add_argument(
        "-s",
        "--silent",
        help="do not print summary for line numbers of\n"
        "translated and rejected comments",
        action="store_true",
    )

    parser.add_argument(
        "-n",
        "--drop-none",
        help="drop any None as assignment value during\n"
        "translation if it is annotated by a type comment",
        action="store_true",
    )

    parser.add_argument(
        "-e",
        "--drop-ellipsis",
        help="drop any Ellipsis (...) as assignment value during\n"
        "translation if it is annotated by a type comment",
        action="store_true",
    )

    parser.add_argument(
        "-i",
        "--add-future-imports",
        help="add 'from __future__ import annotations' to any file\n"
        "where type comments were successfully translated",
        action="store_true",
    )

    parser.add_argument(
        "-w",
        "--wrap-signatures",
        help="wrap function headers that are longer than given length",
        type=int,
        default=0,
    )

    parser.add_argument(
        "-v",
        "--python-minor-version",
        help="Python 3 minor version to use to parse the files",
        type=int,
        default=-1,
    )

    options = parser.parse_args(args)

    infiles: list[pathlib.Path] = options.infiles
    outfile: pathlib.Path | None = options.outfile

    if not infiles:
        parser.exit(status=0, message="No input file, exiting")

    missing_files = [file for file in infiles if not file.exists()]
    if missing_files:
        parser.error(f"File(s) not found: {', '.join(str(e) for e in missing_files)}")

    if len(infiles) == 1:
        if outfile and outfile.exists() and infiles[0].is_dir() != outfile.is_dir():
            parser.error("Infile must be the same type as outfile (file vs. directory)")
    else:
        if outfile is not None:
            parser.error("Cannot use --outfile if multiple infiles are given")

    return vars(options)


def rebase_path(
    path: pathlib.Path, root: pathlib.Path, new_root: pathlib.Path
) -> pathlib.Path:
    """
    Generate a path that is at the same location relative to `new_root` as `path`
    was relative to `root`
    """
    return new_root / path.relative_to(root)


def process_single_entry(
    in_path: pathlib.Path, out_path: pathlib.Path, options: Options
) -> None:

    if in_path.is_file():
        translate_file(infile=str(in_path), outfile=str(out_path), options=options)
    else:
        for in_file in sorted(in_path.glob("**/*.py*")):
            if in_file.suffix not in [".py", ".pyi"] or in_file.is_dir():
                continue

            out_file = rebase_path(path=in_file, root=in_path, new_root=out_path)
            if out_file != in_file:
                out_file.parent.mkdir(parents=True, exist_ok=True)

            translate_file(infile=str(in_file), outfile=str(out_file), options=options)


def main() -> None:
    args = parse_cli_args()

    infiles = args.pop("infiles")
    outfile = args.pop("outfile")
    options = Options(**args)

    for infile in infiles:
        cur_outfile = outfile or infile
        process_single_entry(in_path=infile, out_path=cur_outfile, options=options)


if __name__ == "__main__":
    main()


================================================
FILE: src/test_cli.py
================================================
from __future__ import annotations

import os
import pathlib
from dataclasses import asdict
from typing import Any, Callable, Iterable, TypedDict

import pytest

import com2ann


@pytest.fixture
def test_path(tmp_path: pathlib.Path) -> Iterable[pathlib.Path]:
    old_path = pathlib.Path.cwd()
    os.chdir(tmp_path)
    yield tmp_path
    os.chdir(old_path)


class Exited(Exception):
    pass


class ParseResult(TypedDict, total=False):
    status: int | str | None
    args: dict[str, Any]
    error: bool
    out: str
    err: str


ParseCallable = Callable[..., ParseResult]


@pytest.fixture
def parse(
    capsys: pytest.CaptureFixture[str],
) -> ParseCallable:
    # parse is a pytext fixture returning a function
    # This function will:
    # - Ensure that SystemExit exceptions raised by ArgParse are
    #   caught before they exit pytest
    # - Save the result of the arg parsing if successful
    # - Save the status code otherwise
    # - Save and print whatever argparse printed to stdout and stderr

    def _(*args: str) -> ParseResult:
        result: ParseResult = {"status": 0}
        try:
            result.update({"args": com2ann.parse_cli_args(args), "error": False})
        except SystemExit as exc:
            result.update({"status": exc.code})

        out, err = capsys.readouterr()
        # Will only display if the test fails
        if out:
            print("stdout:", out)
            result["out"] = out
        if err:
            print("stderr:", err)
            result["err"] = err

        return result

    return _


def test_parse_cli_args__minimal(parse: ParseCallable, test_path: pathlib.Path) -> None:
    (test_path / "a.py").touch()

    assert parse("a.py")["args"] == {
        "add_future_imports": False,
        "drop_ellipsis": False,
        "drop_none": False,
        "infiles": [pathlib.Path("a.py")],
        "outfile": None,
        "python_minor_version": -1,
        "silent": False,
        "wrap_signatures": 0,
    }


def test_parse_cli_args__maximal(parse: ParseCallable, test_path: pathlib.Path) -> None:
    (test_path / "a.py").touch()

    assert parse(
        "a.py",
        "--outfile=b.py",
        "--add-future-imports",
        "--drop-ellipsis",
        "--drop-none",
        "--python-minor-version=7",
        "--silent",
        "--wrap-signatures=88",
    )["args"] == {
        "add_future_imports": True,
        "drop_ellipsis": True,
        "drop_none": True,
        "infiles": [pathlib.Path("a.py")],
        "outfile": pathlib.Path("b.py"),
        "python_minor_version": 7,
        "silent": True,
        "wrap_signatures": 88,
    }


def test_parse_cli_args__no_infile(parse: ParseCallable) -> None:
    result = parse()

    assert result == {"err": "No input file, exiting", "status": 0}


def test_parse_cli_args__missing_file(parse: ParseCallable) -> None:

    result = parse("a.py")
    assert result["status"] == 2
    assert "File(s) not found: a.py" in result["err"]


@pytest.mark.parametrize(
    "infile, outfile",
    [
        ("file.py", "dir"),
        ("dir", "file.py"),
    ],
)
def test_parse_cli_args__in_out_type_mismatch(
    parse: ParseCallable, test_path: pathlib.Path, infile: str, outfile: str
) -> None:
    (test_path / "file.py").touch()
    (test_path / "dir").mkdir()

    result = parse(infile, "--outfile", outfile)
    assert result["status"] == 2
    assert "Infile must be the same type" in result["err"]


def test_parse_cli_args__outfile_doesnt_exist(
    parse: ParseCallable, test_path: pathlib.Path
) -> None:
    (test_path / "dir").mkdir()

    result = parse("dir", "--outfile", "other_dir")
    assert result["status"] == 0


def test_parse_cli_args__multiple_inputs_and_output(
    parse: ParseCallable, test_path: pathlib.Path
) -> None:
    (test_path / "a.py").touch()
    (test_path / "b.py").touch()

    result = parse("a.py", "b.py", "--outfile", "c.py")
    assert result["status"] == 2
    assert "Cannot use --outfile if multiple infiles are given" in result["err"]


def test_rebase_path() -> None:
    assert com2ann.rebase_path(
        path=pathlib.Path("a/b/c/d"),
        root=pathlib.Path("a/b/"),
        new_root=pathlib.Path("e/f"),
    ) == pathlib.Path("e/f/c/d")


@pytest.fixture
def options() -> com2ann.Options:
    return com2ann.Options(drop_none=True, drop_ellipsis=True, silent=False)


@pytest.fixture
def translate_file(mocker: Any) -> Any:
    return mocker.patch("com2ann.translate_file", autospec=True)


def test_process_single_entry__file(
    test_path: pathlib.Path, translate_file: Any, options: com2ann.Options
) -> None:
    in_path = test_path / "a.py"
    in_path.touch()

    out_path = test_path / "b.py"

    com2ann.process_single_entry(in_path=in_path, out_path=out_path, options=options)

    translate_file.assert_called_with(
        infile=str(in_path),
        outfile=str(out_path),
        options=options,
    )


def test_process_single_entry__dir(
    test_path: pathlib.Path, translate_file: Any, options: com2ann.Options, mocker: Any
) -> None:
    in_path = test_path / "a"
    (in_path / "c/d/e").mkdir(parents=True)
    (in_path / "c/d/e/f.txt").touch()
    (in_path / "c/d/e/f.py").touch()
    (in_path / "c/d/e/f.pyi").touch()
    (in_path / "c/d/e/f.pyx").touch()

    out_path = test_path / "b"

    com2ann.process_single_entry(in_path=in_path, out_path=out_path, options=options)

    assert translate_file.mock_calls == [
        mocker.call(
            infile=str(in_path / "c/d/e/f.py"),
            outfile=str(out_path / "c/d/e/f.py"),
            options=options,
        ),
        mocker.call(
            infile=str(in_path / "c/d/e/f.pyi"),
            outfile=str(out_path / "c/d/e/f.pyi"),
            options=options,
        ),
    ]


@pytest.fixture
def parse_cli_args(
    test_path: pathlib.Path,
    mocker: Any,
    options: com2ann.Options,
) -> Any:
    f1 = test_path / "f1.py"
    f2 = test_path / "f2.py"

    f1.write_text("f1 = True")
    f2.write_text("f2 = False")

    args = {"infiles": [f1, f2], "outfile": None, **asdict(options)}
    return mocker.patch("com2ann.parse_cli_args", return_value=args)


def test_process_multiple_input_files(
    test_path: pathlib.Path,
    translate_file: Any,
    mocker: Any,
    parse_cli_args: Any,
    options: com2ann.Options,
) -> None:
    com2ann.main()

    assert translate_file.mock_calls == [
        mocker.call(
            infile=str(test_path / "f1.py"),
            outfile=str(test_path / "f1.py"),
            options=options,
        ),
        mocker.call(
            infile=str(test_path / "f2.py"),
            outfile=str(test_path / "f2.py"),
            options=options,
        ),
    ]
    assert (test_path / "f1.py").read_text() == "f1 = True"
    assert (test_path / "f2.py").read_text() == "f2 = False"


================================================
FILE: src/test_com2ann.py
================================================
"""Tests for the com2ann.py script in the Tools/parser directory."""

import re
import unittest
from textwrap import dedent
from typing import List, Optional

from com2ann import TYPE_COM, com2ann


class BaseTestCase(unittest.TestCase):

    def check(
        self,
        code: str,
        expected: Optional[str],
        n: bool = False,
        e: bool = False,
        w: int = 0,
        i: bool = False,
    ) -> None:
        result = com2ann(
            dedent(code),
            drop_none=n,
            drop_ellipsis=e,
            silent=True,
            wrap_sig=w,
            add_future_imports=i,
        )
        if expected is None:
            self.assertIs(result, None)
        else:
            assert result is not None
            new_code, _ = result
            self.assertEqual(new_code, dedent(expected))


class AssignTestCase(BaseTestCase):
    def test_basics(self) -> None:
        self.check("z = 5", "z = 5")
        self.check("z: int = 5", "z: int = 5")
        self.check("z = 5 # type: int", "z: int = 5")
        self.check("z = 5 # type: int # comment", "z: int = 5 # comment")

    def test_type_ignore(self) -> None:
        self.check(
            "foobar = foo_baz() # type: ignore", "foobar = foo_baz() # type: ignore"
        )
        self.check("a = 42 #type: ignore #comment", "a = 42 #type: ignore #comment")
        self.check(
            "foobar = None  # type: int  # type: ignore",
            "foobar: int  # type: ignore",
            True,
            False,
        )

    def test_complete_tuple(self) -> None:
        self.check(
            "t = 1, 2, 3 # type: Tuple[int, ...]", "t: Tuple[int, ...] = (1, 2, 3)"
        )
        self.check("t = 1, # type: Tuple[int]", "t: Tuple[int] = (1,)")
        self.check(
            "t = (1, 2, 3) # type: Tuple[int, ...]", "t: Tuple[int, ...] = (1, 2, 3)"
        )

    def test_drop_None(self) -> None:
        self.check("x = None # type: int", "x: int", True)
        self.check("x = None # type: int # another", "x: int # another", True)
        self.check("x = None # type: int # None", "x: int # None", True)

    def test_drop_Ellipsis(self) -> None:
        self.check("x = ... # type: int", "x: int", False, True)
        self.check("x = ... # type: int # another", "x: int # another", False, True)
        self.check("x = ... # type: int # ...", "x: int # ...", False, True)

    def test_newline(self) -> None:
        self.check("z = 5 # type: int\r\n", "z: int = 5\r\n")
        self.check("z = 5 # type: int # comment\x85", "z: int = 5 # comment\x85")

    def test_wrong(self) -> None:
        self.check("#type : str", "#type : str")
        self.check("x==y #type: bool", None)  # this is syntax error
        self.check("x==y ##type: bool", "x==y ##type: bool")  # this is OK

    def test_pattern(self) -> None:
        for line in ["#type: int", "  # type:  str[:] # com"]:
            self.assertTrue(re.search(TYPE_COM, line))
        for line in ["", "#", "# comment", "#type", "type int:"]:
            self.assertFalse(re.search(TYPE_COM, line))

    def test_uneven_spacing(self) -> None:
        self.check("x = 5   #type: int # this one is OK", "x: int = 5 # this one is OK")

    def test_coding_kept(self) -> None:
        self.check(
            """
            # -*- coding: utf-8 -*- # this should not be spoiled
            '''
            Docstring here
            '''

            import testmod
            from typing import Optional

            coding = None  # type: Optional[str]
            """,
            """
            # -*- coding: utf-8 -*- # this should not be spoiled
            '''
            Docstring here
            '''

            import testmod
            from typing import Optional

            coding: Optional[str] = None
            """,
        )

    def test_multi_line_tuple_value(self) -> None:
        self.check(
            """
            ttt \\
                 = \\
                   1.0, \\
                   2.0, \\
                   3.0, #type: Tuple[float, float, float]
            """,
            """
            ttt: Tuple[float, float, float] \\
                 = \\
                   (1.0, \\
                    2.0, \\
                    3.0,)
            """,
        )

    def test_complex_targets(self) -> None:
        self.check("x = y = z = 1 # type: int", "x = y = z = 1 # type: int")
        self.check(
            "x, y, z = [], [], []  # type: (List[int], List[int], List[str])",
            "x, y, z = [], [], []  # type: (List[int], List[int], List[str])",
        )
        self.check(
            "self.x = None  # type: int  # type: ignore",
            "self.x: int  # type: ignore",
            True,
            False,
        )
        self.check(
            "self.x[0] = []  # type: int  # type: ignore",
            "self.x[0]: int = []  # type: ignore",
        )

    def test_multi_line_assign(self) -> None:
        self.check(
            """
            class C:

                l[f(x
                    =1)] = [

                     g(y), # type: ignore
                     2,
                     ]  # type: List[int]
            """,
            """
            class C:

                l[f(x
                    =1)]: List[int] = [

                     g(y), # type: ignore
                     2,
                     ]
            """,
        )

    def test_parenthesized_lhs(self) -> None:
        self.check(
            """
            (C.x[1]) = \\
                42 == 5# type: bool
            """,
            """
            (C.x[1]): bool = \\
                42 == 5
            """,
        )

    def test_literal_types(self) -> None:
        self.check(
            "x = None  # type: Optional[Literal['#']]",
            "x: Optional[Literal['#']] = None",
        )

    def test_comment_on_separate_line(self) -> None:
        self.check(
            """
            bar = {} \\
                # type: SuperLongType[WithArgs]
            """,
            """
            bar: SuperLongType[WithArgs] = {}
            """,
        )
        self.check(
            """
            bar = {} \\
                # type: SuperLongType[WithArgs]  # noqa
            """,
            """
            bar: SuperLongType[WithArgs] = {} \\
                # noqa
            """,
        )
        self.check(
            """
            bar = None \\
                # type: SuperLongType[WithArgs]
            """,
            """
            bar: SuperLongType[WithArgs]
            """,
            n=True,
        )

    def test_continuation_using_parens(self) -> None:
        self.check(
            """
            X = (
                {one}
                | {other}
            )  # type: Final  # another option
            """,
            """
            X: Final = (
                {one}
                | {other}
            )  # another option
            """,
        )
        self.check(
            """
            X = (  # type: ignore
                {one}
                | {other}
            )  # type: Final
            """,
            """
            X: Final = (  # type: ignore
                {one}
                | {other}
            )
            """,
        )
        self.check(
            """
            foo = object()

            bar = (
                # Comment which explains why this ignored
                foo.quox   # type: ignore[attribute]
            )  # type: Mapping[str, Distribution]
            """,
            """
            foo = object()

            bar: Mapping[str, Distribution] = (
                # Comment which explains why this ignored
                foo.quox   # type: ignore[attribute]
            )
            """,
        )

    def test_with_for(self) -> None:
        self.check(
            """
            for i in range(test):  # type: float
                with open('/some/file'):
                    def f():
                        # type: () -> None
                        x = []  # type: List[int]  # unused
            """,
            """
            for i in range(test):  # type: float
                with open('/some/file'):
                    def f() -> None:
                        x: List[int] = []  # unused
            """,
        )


class FunctionTestCase(BaseTestCase):
    def test_single(self) -> None:
        self.check(
            """
            def add(a, b):  # type: (int, int) -> int
                '''# type: yes'''
            """,
            """
            def add(a: int, b: int) -> int:
                '''# type: yes'''
            """,
        )
        self.check(
            """
            def add(a, b):  # type: (int, int) -> int  # extra comment
                pass
            """,
            """
            def add(a: int, b: int) -> int:  # extra comment
                pass
            """,
        )

    def test_async_single(self) -> None:
        self.check(
            """
            async def add(a, b):  # type: (int, int) -> int
                '''# type: yes'''
            """,
            """
            async def add(a: int, b: int) -> int:
                '''# type: yes'''
            """,
        )
        self.check(
            """
            async def add(a, b):  # type: (int, int) -> int  # extra comment
                pass
            """,
            """
            async def add(a: int, b: int) -> int:  # extra comment
                pass
            """,
        )

    def test_complex_kinds(self) -> None:
        self.check(
            """
            def embezzle(account, funds=MANY, *fake_receipts, stuff, other=None, **kwarg):
                # type: (str, int, *str, Any, Optional[Any], Any) -> None  # note: vararg and kwarg
                pass
            """,
            """
            def embezzle(account: str, funds: int = MANY, *fake_receipts: str, stuff: Any, other: Optional[Any] = None, **kwarg: Any) -> None:
                # note: vararg and kwarg
                pass
            """,
        )  # noqa
        self.check(
            """
            def embezzle(account, funds=MANY, *fake_receipts, stuff, other=None, **kwarg):  # type: ignore
                # type: (str, int, *str, Any, Optional[Any], Any) -> None
                pass
            """,
            """
            def embezzle(account: str, funds: int = MANY, *fake_receipts: str, stuff: Any, other: Optional[Any] = None, **kwarg: Any) -> None:  # type: ignore
                pass
            """,
        )  # noqa

    def test_self_argument(self) -> None:
        self.check(
            """
            def load_cache(self):
                # type: () -> bool
                pass
            """,
            """
            def load_cache(self) -> bool:
                pass
            """,
        )

    def test_invalid_return_type(self) -> None:
        self.check(
            """
            def load_cache(x):
                # type: (str) -> bool -> invalid
                pass
            """,
            """
            def load_cache(x):
                # type: (str) -> bool -> invalid
                pass
            """,
        )

    def test_combined_annotations_single(self) -> None:
        self.check(
            """
            def send_email(address, sender, cc, bcc, subject, body):
                # type: (...) -> bool
                pass
            """,
            """
            def send_email(address, sender, cc, bcc, subject, body) -> bool:
                pass
            """,
        )
        # TODO: should we move an ignore on its own line somewhere else?
        self.check(
            """
            def send_email(address, sender, cc, bcc, subject, body):
                # type: (...) -> BadType  # type: ignore
                pass
            """,
            """
            def send_email(address, sender, cc, bcc, subject, body) -> BadType:
                # type: ignore
                pass
            """,
        )
        self.check(
            """
            def send_email(address, sender, cc, bcc, subject, body):  # type: ignore
                # type: (...) -> bool
                pass
            """,
            """
            def send_email(address, sender, cc, bcc, subject, body) -> bool:  # type: ignore
                pass
            """,
        )

    def test_combined_annotations_multi(self) -> None:
        self.check(
            """
            def send_email(address,     # type: Union[str, List[str]]
               sender,      # type: str
               cc,          # type: Optional[List[str]]  # this is OK
               bcc,         # type: Optional[List[Bad]]  # type: ignore
               subject='',
               body=None,   # type: List[str]
               *args        # type: ignore
               ):
               # type: (...) -> bool
               pass
            """,
            """
            def send_email(address: Union[str, List[str]],
               sender: str,
               cc: Optional[List[str]],  # this is OK
               bcc: Optional[List[Bad]],  # type: ignore
               subject='',
               body: List[str] = None,
               *args        # type: ignore
               ) -> bool:
               pass
            """,
        )

    def test_literal_type(self) -> None:
        self.check(
            """
            def force_hash(
                arg,  # type: Literal['#']
            ):
                # type: (...) -> Literal['#']
                pass
            """,
            """
            def force_hash(
                arg: Literal['#'],
            ) -> Literal['#']:
                pass
            """,
        )

    def test_wrap_lines(self) -> None:
        self.check(
            """
            def embezzle(self, account, funds=MANY, *fake_receipts):
                # type: (str, int, *str) -> None  # some comment
                pass
            """,
            """
            def embezzle(self,
                         account: str,
                         funds: int = MANY,
                         *fake_receipts: str) -> None:
                # some comment
                pass
            """,
            False,
            False,
            10,
        )
        self.check(
            """
            def embezzle(self, account, funds=MANY, *fake_receipts):  # type: ignore
                # type: (str, int, *str) -> None
                pass
            """,
            """
            def embezzle(self,  # type: ignore
                         account: str,
                         funds: int = MANY,
                         *fake_receipts: str) -> None:
                pass
            """,
            False,
            False,
            10,
        )
        self.check(
            """
            def embezzle(self, account, funds=MANY, *fake_receipts):
                # type: (str, int, *str) -> Dict[str, Dict[str, int]]
                pass
            """,
            """
            def embezzle(self,
                         account: str,
                         funds: int = MANY,
                         *fake_receipts: str) -> Dict[str, Dict[str, int]]:
                pass
            """,
            False,
            False,
            10,
        )

    def test_wrap_lines_error_code(self) -> None:
        self.check(
            """
            def embezzle(self, account, funds=MANY, *fake_receipts):  # type: ignore[override]
                # type: (str, int, *str) -> None
                pass
            """,
            """
            def embezzle(self,  # type: ignore[override]
                         account: str,
                         funds: int = MANY,
                         *fake_receipts: str) -> None:
                pass
            """,
            False,
            False,
            10,
        )

    def test_decorator_body(self) -> None:
        self.check(
            """
            def outer(self):  # a method
                # type: () -> None
                @deco()
                def inner():
                    # type: () -> None
                    pass
            """,
            """
            def outer(self) -> None:  # a method
                @deco()
                def inner() -> None:
                    pass
            """,
        )
        self.check(
            """
            def func(
                x,  # type: int
                *other,  # type: Any
            ):
                # type: () -> None
                @dataclass
                class C:
                    x = None  # type: int
            """,
            """
            def func(
                x: int,
                *other: Any,
            ) -> None:
                @dataclass
                class C:
                    x: int
            """,
            n=True,
        )

    def test_keyword_only_args(self) -> None:
        self.check(
            """
            def func(self,
                *,
                account,
                callback,  # type: Callable[[], None]
                start=0,  # type: int
                order,  # type: bool
                ):
                # type: (...) -> None
                ...
            """,
            """
            def func(self,
                *,
                account,
                callback: Callable[[], None],
                start: int = 0,
                order: bool,
                ) -> None:
                ...
            """,
        )

    def test_next_line_comment(self) -> None:
        self.check(
            """
            def __init__(
                self,
                short,                # type: Short
                long_argument,
                # type: LongType[int, str]
                other,                # type: Other
            ):
                # type: (...) -> None
                '''
                Some function.
                '''
            """,
            """
            def __init__(
                self,
                short: Short,
                long_argument: LongType[int, str],
                other: Other,
            ) -> None:
                '''
                Some function.
                '''
            """,
        )


class LineReportingTestCase(BaseTestCase):
    def compare(self, code: str, success: List[int], fail: List[int]) -> None:
        result = com2ann(dedent(code), silent=True)
        assert result is not None
        _, data = result
        self.assertEqual(data.success, success)
        self.assertEqual(data.fail, fail)

    def test_simple_assign(self) -> None:
        self.compare(
            """
            x = None  # type: Optional[str]
            """,
            [2],
            [],
        )

    def test_simple_function(self) -> None:
        self.compare(
            """
            def func(arg):
                # type: (int) -> int
                pass
            """,
            [2],
            [],
        )

    def test_unsupported_assigns(self) -> None:
        self.compare(
            """
            x, y = None, None  # type: (int, int)
            x = None  # type: Optional[str]
            x = y = []  # type: List[int]
            """,
            [3],
            [2, 4],
        )

    def test_invalid_function_comments(self) -> None:
        self.compare(
            """
            def func(arg):
                # type: bad
                pass
            def func(arg):
                # type: bad -> bad
                pass
            """,
            [],
            [2, 5],
        )

    def test_confusing_function_comments(self) -> None:
        self.compare(
            """
            def func1(
                arg  # type: int
            ):
                # type: (str) -> int
                pass
            def func2(arg1, arg2, arg3):
                # type: (int) -> int
                pass
            """,
            [],
            [2, 7],
        )

    def test_unsupported_statements(self) -> None:
        self.compare(
            """
            with foo(x==1) as f: # type: str
                print(f)
            with foo(x==1) as f:
                print(f)
            x = None  # type: Optional[str]
            for i, j in my_inter(x=1):
                i + j
            for i, j in my_inter(x=1): # type: (int, int)  # type: ignore
                i + j
            """,
            [6],
            [2, 9],
        )


class ForAndWithTestCase(BaseTestCase):
    def test_with(self) -> None:
        self.check(
            """
            with foo(x==1) as f: #type: str
                print(f)
            """,
            """
            with foo(x==1) as f: #type: str
                print(f)
            """,
        )

    def test_for(self) -> None:
        self.check(
            """
            for i, j in my_inter(x=1): # type: (int, int)  # type: ignore
                i + j
            """,
            """
            for i, j in my_inter(x=1): # type: (int, int)  # type: ignore
                i + j
            """,
        )

    def test_async_with(self) -> None:
        self.check(
            """
            async with foo(x==1) as f: #type: str
                print(f)
            """,
            """
            async with foo(x==1) as f: #type: str
                print(f)
            """,
        )

    def test_async_for(self) -> None:
        self.check(
            """
            async for i, j in my_inter(x=1): # type: (int, int)  # type: ignore
                i + j
            """,
            """
            async for i, j in my_inter(x=1): # type: (int, int)  # type: ignore
                i + j
            """,
        )


class FutureImportTestCase(BaseTestCase):
    def test_added_future_import(self) -> None:
        self.check(
            """
            # coding: utf-8

            x = None  # type: Optional[str]
            """,
            """
            # coding: utf-8

            from __future__ import annotations
            x: Optional[str] = None
            """,
            i=True,
        )

    def test_not_added_future_import(self) -> None:
        self.check(
            """
            x = 1
            """,
            """
            x = 1
            """,
            i=True,
        )
        self.check(
            """
            x, y = a, b  # type: Tuple[int, int]
            """,
            """
            x, y = a, b  # type: Tuple[int, int]
            """,
            i=True,
        )
        self.check(
            """
            def foo(arg1, arg2):
                # type: (int, str) -> None
                pass
            x = False  # type: bool
            """,
            """
            def foo(arg1: int, arg2: str) -> None:
                pass
            x: bool = False
            """,
            i=True,
        )


if __name__ == "__main__":
    unittest.main()


================================================
FILE: test-requirements.txt
================================================
bandit
black
codespell
flake8
flake8-bugbear
flake8-pyi
isort
mypy
pytest
pytest-cov>=2.4.0
pytest-mock>=3.7.0
pytest-xdist>=1.13
pyupgrade
safety
types-setuptools