Full Code of Megvii-BaseDetection/YOLOX for AI

main 6ddff4824372 cached

154 files

613.3 KB

169.4k tokens

601 symbols

1 requests

Download .txt

Showing preview only (654K chars total). Download the full file or copy to clipboard to get everything.

Repository: Megvii-BaseDetection/YOLOX
Branch: main
Commit: 6ddff4824372
Files: 154
Total size: 613.3 KB

Directory structure:
gitextract_i4lyl_zt/

├── .github/
│   └── workflows/
│       ├── ci.yaml
│       └── format_check.sh
├── .gitignore
├── .pre-commit-config.yaml
├── .readthedocs.yaml
├── LICENSE
├── MANIFEST.in
├── README.md
├── SECURITY.md
├── demo/
│   ├── MegEngine/
│   │   ├── cpp/
│   │   │   ├── README.md
│   │   │   ├── build.sh
│   │   │   └── yolox.cpp
│   │   └── python/
│   │       ├── README.md
│   │       ├── build.py
│   │       ├── convert_weights.py
│   │       ├── demo.py
│   │       ├── dump.py
│   │       └── models/
│   │           ├── __init__.py
│   │           ├── darknet.py
│   │           ├── network_blocks.py
│   │           ├── yolo_fpn.py
│   │           ├── yolo_head.py
│   │           ├── yolo_pafpn.py
│   │           └── yolox.py
│   ├── ONNXRuntime/
│   │   ├── README.md
│   │   └── onnx_inference.py
│   ├── OpenVINO/
│   │   ├── README.md
│   │   ├── cpp/
│   │   │   ├── CMakeLists.txt
│   │   │   ├── README.md
│   │   │   └── yolox_openvino.cpp
│   │   └── python/
│   │       ├── README.md
│   │       └── openvino_inference.py
│   ├── TensorRT/
│   │   ├── cpp/
│   │   │   ├── CMakeLists.txt
│   │   │   ├── README.md
│   │   │   ├── logging.h
│   │   │   └── yolox.cpp
│   │   └── python/
│   │       └── README.md
│   ├── ncnn/
│   │   ├── README.md
│   │   ├── android/
│   │   │   ├── README.md
│   │   │   ├── app/
│   │   │   │   ├── build.gradle
│   │   │   │   └── src/
│   │   │   │       └── main/
│   │   │   │           ├── AndroidManifest.xml
│   │   │   │           ├── assets/
│   │   │   │           │   └── yolox.param
│   │   │   │           ├── java/
│   │   │   │           │   └── com/
│   │   │   │           │       └── megvii/
│   │   │   │           │           └── yoloXncnn/
│   │   │   │           │               ├── MainActivity.java
│   │   │   │           │               ├── YOLOXncnn.java
│   │   │   │           │               └── yoloXncnn.java
│   │   │   │           ├── jni/
│   │   │   │           │   ├── CMakeLists.txt
│   │   │   │           │   └── yoloXncnn_jni.cpp
│   │   │   │           └── res/
│   │   │   │               ├── layout/
│   │   │   │               │   └── main.xml
│   │   │   │               └── values/
│   │   │   │                   └── strings.xml
│   │   │   ├── build.gradle
│   │   │   ├── gradle/
│   │   │   │   └── wrapper/
│   │   │   │       ├── gradle-wrapper.jar
│   │   │   │       └── gradle-wrapper.properties
│   │   │   ├── gradlew
│   │   │   ├── gradlew.bat
│   │   │   └── settings.gradle
│   │   └── cpp/
│   │       ├── README.md
│   │       └── yolox.cpp
│   └── nebullvm/
│       ├── README.md
│       └── nebullvm_optimization.py
├── docs/
│   ├── .gitignore
│   ├── Makefile
│   ├── _static/
│   │   └── css/
│   │       └── custom.css
│   ├── assignment_visualization.md
│   ├── cache.md
│   ├── conf.py
│   ├── freeze_module.md
│   ├── index.rst
│   ├── manipulate_training_image_size.md
│   ├── mlflow_integration.md
│   ├── model_zoo.md
│   ├── quick_run.md
│   ├── requirements-doc.txt
│   ├── train_custom_data.md
│   └── updates_note.md
├── exps/
│   ├── default/
│   │   ├── __init__.py
│   │   ├── yolov3.py
│   │   ├── yolox_l.py
│   │   ├── yolox_m.py
│   │   ├── yolox_nano.py
│   │   ├── yolox_s.py
│   │   ├── yolox_tiny.py
│   │   └── yolox_x.py
│   └── example/
│       ├── custom/
│       │   ├── nano.py
│       │   └── yolox_s.py
│       └── yolox_voc/
│           └── yolox_voc_s.py
├── hubconf.py
├── requirements.txt
├── setup.cfg
├── setup.py
├── tests/
│   ├── __init__.py
│   └── utils/
│       └── test_model_utils.py
├── tools/
│   ├── __init__.py
│   ├── demo.py
│   ├── eval.py
│   ├── export_onnx.py
│   ├── export_torchscript.py
│   ├── train.py
│   ├── trt.py
│   └── visualize_assign.py
└── yolox/
    ├── __init__.py
    ├── core/
    │   ├── __init__.py
    │   ├── launch.py
    │   └── trainer.py
    ├── data/
    │   ├── __init__.py
    │   ├── data_augment.py
    │   ├── data_prefetcher.py
    │   ├── dataloading.py
    │   ├── datasets/
    │   │   ├── __init__.py
    │   │   ├── coco.py
    │   │   ├── coco_classes.py
    │   │   ├── datasets_wrapper.py
    │   │   ├── mosaicdetection.py
    │   │   ├── voc.py
    │   │   └── voc_classes.py
    │   └── samplers.py
    ├── evaluators/
    │   ├── __init__.py
    │   ├── coco_evaluator.py
    │   ├── voc_eval.py
    │   └── voc_evaluator.py
    ├── exp/
    │   ├── __init__.py
    │   ├── base_exp.py
    │   ├── build.py
    │   ├── default/
    │   │   └── __init__.py
    │   └── yolox_base.py
    ├── layers/
    │   ├── __init__.py
    │   ├── cocoeval/
    │   │   ├── cocoeval.cpp
    │   │   └── cocoeval.h
    │   ├── fast_coco_eval_api.py
    │   └── jit_ops.py
    ├── models/
    │   ├── __init__.py
    │   ├── build.py
    │   ├── darknet.py
    │   ├── losses.py
    │   ├── network_blocks.py
    │   ├── yolo_fpn.py
    │   ├── yolo_head.py
    │   ├── yolo_pafpn.py
    │   └── yolox.py
    ├── tools/
    │   └── __init__.py
    └── utils/
        ├── __init__.py
        ├── allreduce_norm.py
        ├── boxes.py
        ├── checkpoint.py
        ├── compat.py
        ├── demo_utils.py
        ├── dist.py
        ├── ema.py
        ├── logger.py
        ├── lr_scheduler.py
        ├── metric.py
        ├── mlflow_logger.py
        ├── model_utils.py
        ├── setup_env.py
        └── visualize.py

================================================
FILE CONTENTS
================================================

================================================
FILE: .github/workflows/ci.yaml
================================================
# This is a basic workflow to help you get started with Actions

name: CI

# Controls when the action will run. Triggers the workflow on push or pull request
# events but only for the master branch
on:
  push:
  pull_request:

# A workflow run is made up of one or more jobs that can run sequentially or in parallel
jobs:
  # This workflow contains a single job called "build"
  build:
    # The type of runner that the job will run on
    runs-on: ubuntu-22.04
    strategy:
      matrix:
        python-version: ["3.8", "3.9", "3.10"]

    # Steps represent a sequence of tasks that will be executed as part of the job
    steps:
    # Checks-out your repository under $GITHUB_WORKSPACE, so your job can access it
    - uses: actions/checkout@v2

    - name: Set up Python ${{ matrix.python-version }}
      uses: actions/setup-python@v1
      with:
        python-version: ${{ matrix.python-version }}

    - name: Install dependencies
      run: |
        python -m pip install --upgrade pip
        pip install -r requirements.txt
        pip install isort==4.3.21
        pip install flake8==3.8.3
        pip install "importlib-metadata<5.0"
    # Runs a set of commands using the runners shell
    - name: Format check
      run: ./.github/workflows/format_check.sh


================================================
FILE: .github/workflows/format_check.sh
================================================
#!/bin/bash -e

set -e

export PYTHONPATH=$PWD:$PYTHONPATH

flake8 yolox exps tools || flake8_ret=$?
if [ "$flake8_ret" ]; then
    exit $flake8_ret
fi
echo "All flake check passed!"
isort --check-only -rc yolox exps || isort_ret=$?
if [ "$isort_ret" ]; then
    exit $isort_ret
fi
echo "All isort check passed!"


================================================
FILE: .gitignore
================================================
### Linux ###
*~

# user experiments directory
YOLOX_outputs/
datasets/
# do not ignore datasets under yolox/data
!*yolox/data/datasets/

# temporary files which can be created if a process still has a handle open of a deleted file
.fuse_hidden*

# KDE directory preferences
.directory

# Linux trash folder which might appear on any partition or disk
.Trash-*

# .nfs files are created when an open file is removed but is still being accessed
.nfs*

### PyCharm ###
# User-specific stuff
.idea

# CMake
cmake-build-*/

# Mongo Explorer plugin
.idea/**/mongoSettings.xml

# File-based project format
*.iws

# IntelliJ
out/

# mpeltonen/sbt-idea plugin
.idea_modules/

# JIRA plugin
atlassian-ide-plugin.xml

# Cursive Clojure plugin
.idea/replstate.xml

# Crashlytics plugin (for Android Studio and IntelliJ)
com_crashlytics_export_strings.xml
crashlytics.properties
crashlytics-build.properties
fabric.properties

# Editor-based Rest Client
.idea/httpRequests

# Android studio 3.1+ serialized cache file
.idea/caches/build_file_checksums.ser

# JetBrains templates
**___jb_tmp___

### Python ###
# Byte-compiled / optimized / DLL files
__pycache__/
*.py[cod]
*$py.class

# C extensions
*.so

# Distribution / packaging
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
pip-wheel-metadata/
share/python-wheels/
*.egg-info/
.installed.cfg
*.egg
MANIFEST

# PyInstaller
#  Usually these files are written by a python script from a template
#  before PyInstaller builds the exe, so as to inject date/other infos into it.
*.manifest
*.spec

# Installer logs
pip-log.txt
pip-delete-this-directory.txt

# Unit test / coverage reports
htmlcov/
.tox/
.nox/
.coverage
.coverage.*
.cache
nosetests.xml
coverage.xml
*.cover
.hypothesis/
.pytest_cache/

# Translations
*.mo
*.pot

# Django stuff:
*.log
local_settings.py
db.sqlite3

# Flask stuff:
instance/
.webassets-cache

# Scrapy stuff:
.scrapy

# Sphinx documentation
docs/_build/
docs/build/

# PyBuilder
target/

# Jupyter Notebook
.ipynb_checkpoints

# IPython
profile_default/
ipython_config.py

# pyenv
.python-version

# pipenv
#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
#   However, in case of collaboration, if having platform-specific dependencies or dependencies
#   having no cross-platform support, pipenv may install dependencies that don’t work, or not
#   install all needed dependencies.
#Pipfile.lock

# celery beat schedule file
celerybeat-schedule

# SageMath parsed files
*.sage.py

# Environments
.env
.venv
env/
venv/
ENV/
env.bak/
venv.bak/

# Spyder project settings
.spyderproject
.spyproject

# Rope project settings
.ropeproject

# mkdocs documentation
/site

# mypy
.mypy_cache/
.dmypy.json
dmypy.json

# Pyre type checker
.pyre/

### Vim ###
# Swap
[._]*.s[a-v][a-z]
[._]*.sw[a-p]
[._]s[a-rt-v][a-z]
[._]ss[a-gi-z]
[._]sw[a-p]

# Session
Session.vim

# Temporary
.netrwhist
# Auto-generated tag files
tags
# Persistent undo
[._]*.un~

# output
docs/api
.code-workspace.code-workspace
*.pkl
*.npy
*.pth
*.onnx
*.engine
events.out.tfevents*

# vscode
*.code-workspace
.vscode

# vim
.vim

# OS generated files
.DS_Store
.DS_Store?
.Trashes
ehthumbs.db
Thumbs.db


================================================
FILE: .pre-commit-config.yaml
================================================
repos:
  - repo: https://github.com/pycqa/flake8
    rev: 3.8.3
    hooks:
      - id: flake8
  - repo: https://github.com/pre-commit/pre-commit-hooks
    rev: v3.1.0
    hooks:
      - id: check-added-large-files
      - id: check-docstring-first
      - id: check-executables-have-shebangs
      - id: check-json
      - id: check-yaml
        args: ["--unsafe"]
      - id: debug-statements
      - id: end-of-file-fixer
      - id: requirements-txt-fixer
      - id: trailing-whitespace
  - repo: https://github.com/jorisroovers/gitlint
    rev: v0.15.1
    hooks:
      - id: gitlint
  - repo: https://github.com/pycqa/isort
    rev: 4.3.21
    hooks:
      - id: isort

  - repo: https://github.com/PyCQA/autoflake
    rev: v1.4
    hooks:
      - id: autoflake
        name: Remove unused variables and imports
        entry: autoflake
        language: python
        args:
          [
            "--in-place",
            "--remove-all-unused-imports",
            "--remove-unused-variables",
            "--expand-star-imports",
            "--ignore-init-module-imports",
          ]
        files: \.py$


================================================
FILE: .readthedocs.yaml
================================================
# .readthedocs.yaml
# Read the Docs configuration file
# See https://docs.readthedocs.io/en/stable/config-file/v2.html for details

# Required
version: 2

# Build documentation in the docs/ directory with Sphinx
sphinx:
   configuration: docs/conf.py

# Optionally build your docs in additional formats such as PDF
formats:
   - pdf

# Optionally set the version of Python and requirements required to build your docs
python:
   version: "3.7"
   install:
   - requirements: docs/requirements-doc.txt
   - requirements: requirements.txt


================================================
FILE: LICENSE
================================================
                                 Apache License
                           Version 2.0, January 2004
                        http://www.apache.org/licenses/

   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION

   1. Definitions.

      "License" shall mean the terms and conditions for use, reproduction,
      and distribution as defined by Sections 1 through 9 of this document.

      "Licensor" shall mean the copyright owner or entity authorized by
      the copyright owner that is granting the License.

      "Legal Entity" shall mean the union of the acting entity and all
      other entities that control, are controlled by, or are under common
      control with that entity. For the purposes of this definition,
      "control" means (i) the power, direct or indirect, to cause the
      direction or management of such entity, whether by contract or
      otherwise, or (ii) ownership of fifty percent (50%) or more of the
      outstanding shares, or (iii) beneficial ownership of such entity.

      "You" (or "Your") shall mean an individual or Legal Entity
      exercising permissions granted by this License.

      "Source" form shall mean the preferred form for making modifications,
      including but not limited to software source code, documentation
      source, and configuration files.

      "Object" form shall mean any form resulting from mechanical
      transformation or translation of a Source form, including but
      not limited to compiled object code, generated documentation,
      and conversions to other media types.

      "Work" shall mean the work of authorship, whether in Source or
      Object form, made available under the License, as indicated by a
      copyright notice that is included in or attached to the work
      (an example is provided in the Appendix below).

      "Derivative Works" shall mean any work, whether in Source or Object
      form, that is based on (or derived from) the Work and for which the
      editorial revisions, annotations, elaborations, or other modifications
      represent, as a whole, an original work of authorship. For the purposes
      of this License, Derivative Works shall not include works that remain
      separable from, or merely link (or bind by name) to the interfaces of,
      the Work and Derivative Works thereof.

      "Contribution" shall mean any work of authorship, including
      the original version of the Work and any modifications or additions
      to that Work or Derivative Works thereof, that is intentionally
      submitted to Licensor for inclusion in the Work by the copyright owner
      or by an individual or Legal Entity authorized to submit on behalf of
      the copyright owner. For the purposes of this definition, "submitted"
      means any form of electronic, verbal, or written communication sent
      to the Licensor or its representatives, including but not limited to
      communication on electronic mailing lists, source code control systems,
      and issue tracking systems that are managed by, or on behalf of, the
      Licensor for the purpose of discussing and improving the Work, but
      excluding communication that is conspicuously marked or otherwise
      designated in writing by the copyright owner as "Not a Contribution."

      "Contributor" shall mean Licensor and any individual or Legal Entity
      on behalf of whom a Contribution has been received by Licensor and
      subsequently incorporated within the Work.

   2. Grant of Copyright License. Subject to the terms and conditions of
      this License, each Contributor hereby grants to You a perpetual,
      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
      copyright license to reproduce, prepare Derivative Works of,
      publicly display, publicly perform, sublicense, and distribute the
      Work and such Derivative Works in Source or Object form.

   3. Grant of Patent License. Subject to the terms and conditions of
      this License, each Contributor hereby grants to You a perpetual,
      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
      (except as stated in this section) patent license to make, have made,
      use, offer to sell, sell, import, and otherwise transfer the Work,
      where such license applies only to those patent claims licensable
      by such Contributor that are necessarily infringed by their
      Contribution(s) alone or by combination of their Contribution(s)
      with the Work to which such Contribution(s) was submitted. If You
      institute patent litigation against any entity (including a
      cross-claim or counterclaim in a lawsuit) alleging that the Work
      or a Contribution incorporated within the Work constitutes direct
      or contributory patent infringement, then any patent licenses
      granted to You under this License for that Work shall terminate
      as of the date such litigation is filed.

   4. Redistribution. You may reproduce and distribute copies of the
      Work or Derivative Works thereof in any medium, with or without
      modifications, and in Source or Object form, provided that You
      meet the following conditions:

      (a) You must give any other recipients of the Work or
          Derivative Works a copy of this License; and

      (b) You must cause any modified files to carry prominent notices
          stating that You changed the files; and

      (c) You must retain, in the Source form of any Derivative Works
          that You distribute, all copyright, patent, trademark, and
          attribution notices from the Source form of the Work,
          excluding those notices that do not pertain to any part of
          the Derivative Works; and

      (d) If the Work includes a "NOTICE" text file as part of its
          distribution, then any Derivative Works that You distribute must
          include a readable copy of the attribution notices contained
          within such NOTICE file, excluding those notices that do not
          pertain to any part of the Derivative Works, in at least one
          of the following places: within a NOTICE text file distributed
          as part of the Derivative Works; within the Source form or
          documentation, if provided along with the Derivative Works; or,
          within a display generated by the Derivative Works, if and
          wherever such third-party notices normally appear. The contents
          of the NOTICE file are for informational purposes only and
          do not modify the License. You may add Your own attribution
          notices within Derivative Works that You distribute, alongside
          or as an addendum to the NOTICE text from the Work, provided
          that such additional attribution notices cannot be construed
          as modifying the License.

      You may add Your own copyright statement to Your modifications and
      may provide additional or different license terms and conditions
      for use, reproduction, or distribution of Your modifications, or
      for any such Derivative Works as a whole, provided Your use,
      reproduction, and distribution of the Work otherwise complies with
      the conditions stated in this License.

   5. Submission of Contributions. Unless You explicitly state otherwise,
      any Contribution intentionally submitted for inclusion in the Work
      by You to the Licensor shall be under the terms and conditions of
      this License, without any additional terms or conditions.
      Notwithstanding the above, nothing herein shall supersede or modify
      the terms of any separate license agreement you may have executed
      with Licensor regarding such Contributions.

   6. Trademarks. This License does not grant permission to use the trade
      names, trademarks, service marks, or product names of the Licensor,
      except as required for reasonable and customary use in describing the
      origin of the Work and reproducing the content of the NOTICE file.

   7. Disclaimer of Warranty. Unless required by applicable law or
      agreed to in writing, Licensor provides the Work (and each
      Contributor provides its Contributions) on an "AS IS" BASIS,
      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
      implied, including, without limitation, any warranties or conditions
      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
      PARTICULAR PURPOSE. You are solely responsible for determining the
      appropriateness of using or redistributing the Work and assume any
      risks associated with Your exercise of permissions under this License.

   8. Limitation of Liability. In no event and under no legal theory,
      whether in tort (including negligence), contract, or otherwise,
      unless required by applicable law (such as deliberate and grossly
      negligent acts) or agreed to in writing, shall any Contributor be
      liable to You for damages, including any direct, indirect, special,
      incidental, or consequential damages of any character arising as a
      result of this License or out of the use or inability to use the
      Work (including but not limited to damages for loss of goodwill,
      work stoppage, computer failure or malfunction, or any and all
      other commercial damages or losses), even if such Contributor
      has been advised of the possibility of such damages.

   9. Accepting Warranty or Additional Liability. While redistributing
      the Work or Derivative Works thereof, You may choose to offer,
      and charge a fee for, acceptance of support, warranty, indemnity,
      or other liability obligations and/or rights consistent with this
      License. However, in accepting such obligations, You may act only
      on Your own behalf and on Your sole responsibility, not on behalf
      of any other Contributor, and only if You agree to indemnify,
      defend, and hold each Contributor harmless for any liability
      incurred by, or claims asserted against, such Contributor by reason
      of your accepting any such warranty or additional liability.

   END OF TERMS AND CONDITIONS

   APPENDIX: How to apply the Apache License to your work.

      To apply the Apache License to your work, attach the following
      boilerplate notice, with the fields enclosed by brackets "{}"
      replaced with your own identifying information. (Don't include
      the brackets!)  The text should be enclosed in the appropriate
      comment syntax for the file format. We also recommend that a
      file or class name and description of purpose be included on the
      same "printed page" as the copyright notice for easier
      identification within third-party archives.

   Copyright (c) 2021-2022 Megvii Inc. All rights reserved.

   Licensed under the Apache License, Version 2.0 (the "License");
   you may not use this file except in compliance with the License.
   You may obtain a copy of the License at

       http://www.apache.org/licenses/LICENSE-2.0

   Unless required by applicable law or agreed to in writing, software
   distributed under the License is distributed on an "AS IS" BASIS,
   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
   See the License for the specific language governing permissions and
   limitations under the License.


================================================
FILE: MANIFEST.in
================================================
include requirements.txt
recursive-include yolox *.cpp *.h *.cu *.cuh *.cc


================================================
FILE: README.md
================================================
<div align="center"><img src="assets/logo.png" width="350"></div>
<img src="assets/demo.png" >

## Introduction
YOLOX is an anchor-free version of YOLO, with a simpler design but better performance! It aims to bridge the gap between research and industrial communities.
For more details, please refer to our [report on Arxiv](https://arxiv.org/abs/2107.08430).

This repo is an implementation of PyTorch version YOLOX, there is also a [MegEngine implementation](https://github.com/MegEngine/YOLOX).

<img src="assets/git_fig.png" width="1000" >

## Updates!!
* 【2023/02/28】 We support assignment visualization tool, see doc [here](./docs/assignment_visualization.md).
* 【2022/04/14】 We support jit compile op.
* 【2021/08/19】 We optimize the training process with **2x** faster training and **~1%** higher performance! See [notes](docs/updates_note.md) for more details.
* 【2021/08/05】 We release [MegEngine version YOLOX](https://github.com/MegEngine/YOLOX).
* 【2021/07/28】 We fix the fatal error of [memory leak](https://github.com/Megvii-BaseDetection/YOLOX/issues/103)
* 【2021/07/26】 We now support [MegEngine](https://github.com/Megvii-BaseDetection/YOLOX/tree/main/demo/MegEngine) deployment.
* 【2021/07/20】 We have released our technical report on [Arxiv](https://arxiv.org/abs/2107.08430).

## Benchmark

#### Standard Models.

|Model |size |mAP<sup>val<br>0.5:0.95 |mAP<sup>test<br>0.5:0.95 | Speed V100<br>(ms) | Params<br>(M) |FLOPs<br>(G)| weights |
| ------        |:---: | :---:    | :---:       |:---:     |:---:  | :---: | :----: |
|[YOLOX-s](./exps/default/yolox_s.py)    |640  |40.5 |40.5      |9.8      |9.0 | 26.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_s.pth) |
|[YOLOX-m](./exps/default/yolox_m.py)    |640  |46.9 |47.2      |12.3     |25.3 |73.8| [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_m.pth) |
|[YOLOX-l](./exps/default/yolox_l.py)    |640  |49.7 |50.1      |14.5     |54.2| 155.6 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_l.pth) |
|[YOLOX-x](./exps/default/yolox_x.py)   |640   |51.1 |**51.5**  | 17.3    |99.1 |281.9 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_x.pth) |
|[YOLOX-Darknet53](./exps/default/yolov3.py)   |640  | 47.7 | 48.0 | 11.1 |63.7 | 185.3 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_darknet.pth) |

<details>
<summary>Legacy models</summary>

|Model |size |mAP<sup>test<br>0.5:0.95 | Speed V100<br>(ms) | Params<br>(M) |FLOPs<br>(G)| weights |
| ------        |:---: | :---:       |:---:     |:---:  | :---: | :----: |
|[YOLOX-s](./exps/default/yolox_s.py)    |640  |39.6      |9.8     |9.0 | 26.8 | [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/EW62gmO2vnNNs5npxjzunVwB9p307qqygaCkXdTO88BLUg?e=NMTQYw)/[github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_s.pth) |
|[YOLOX-m](./exps/default/yolox_m.py)    |640  |46.4      |12.3     |25.3 |73.8| [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/ERMTP7VFqrVBrXKMU7Vl4TcBQs0SUeCT7kvc-JdIbej4tQ?e=1MDo9y)/[github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_m.pth) |
|[YOLOX-l](./exps/default/yolox_l.py)    |640  |50.0  |14.5 |54.2| 155.6 | [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/EWA8w_IEOzBKvuueBqfaZh0BeoG5sVzR-XYbOJO4YlOkRw?e=wHWOBE)/[github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_l.pth) |
|[YOLOX-x](./exps/default/yolox_x.py)   |640  |**51.2**      | 17.3 |99.1 |281.9 | [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/EdgVPHBziOVBtGAXHfeHI5kBza0q9yyueMGdT0wXZfI1rQ?e=tABO5u)/[github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_x.pth) |
|[YOLOX-Darknet53](./exps/default/yolov3.py)   |640  | 47.4      | 11.1 |63.7 | 185.3 | [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/EZ-MV1r_fMFPkPrNjvbJEMoBLOLAnXH-XKEB77w8LhXL6Q?e=mf6wOc)/[github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_darknet53.pth) |

</details>

#### Light Models.

|Model |size |mAP<sup>val<br>0.5:0.95 | Params<br>(M) |FLOPs<br>(G)| weights |
| ------        |:---:  |  :---:       |:---:     |:---:  | :---: |
|[YOLOX-Nano](./exps/default/yolox_nano.py) |416  |25.8  | 0.91 |1.08 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_nano.pth) |
|[YOLOX-Tiny](./exps/default/yolox_tiny.py) |416  |32.8 | 5.06 |6.45 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_tiny.pth) |


<details>
<summary>Legacy models</summary>

|Model |size |mAP<sup>val<br>0.5:0.95 | Params<br>(M) |FLOPs<br>(G)| weights |
| ------        |:---:  |  :---:       |:---:     |:---:  | :---: |
|[YOLOX-Nano](./exps/default/yolox_nano.py) |416  |25.3  | 0.91 |1.08 | [github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_nano.pth) |
|[YOLOX-Tiny](./exps/default/yolox_tiny.py) |416  |32.8 | 5.06 |6.45 | [github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_tiny_32dot8.pth) |

</details>

## Quick Start

<details>
<summary>Installation</summary>

Step1. Install YOLOX from source.
```shell
git clone git@github.com:Megvii-BaseDetection/YOLOX.git
cd YOLOX
pip3 install -v -e .  # or  python3 setup.py develop
```

</details>

<details>
<summary>Demo</summary>

Step1. Download a pretrained model from the benchmark table.

Step2. Use either -n or -f to specify your detector's config. For example:

```shell
python tools/demo.py image -n yolox-s -c /path/to/your/yolox_s.pth --path assets/dog.jpg --conf 0.25 --nms 0.45 --tsize 640 --save_result --device [cpu/gpu]
```
or
```shell
python tools/demo.py image -f exps/default/yolox_s.py -c /path/to/your/yolox_s.pth --path assets/dog.jpg --conf 0.25 --nms 0.45 --tsize 640 --save_result --device [cpu/gpu]
```
Demo for video:
```shell
python tools/demo.py video -n yolox-s -c /path/to/your/yolox_s.pth --path /path/to/your/video --conf 0.25 --nms 0.45 --tsize 640 --save_result --device [cpu/gpu]
```


</details>

<details>
<summary>Reproduce our results on COCO</summary>

Step1. Prepare COCO dataset
```shell
cd <YOLOX_HOME>
ln -s /path/to/your/COCO ./datasets/COCO
```

Step2. Reproduce our results on COCO by specifying -n:

```shell
python -m yolox.tools.train -n yolox-s -d 8 -b 64 --fp16 -o [--cache]
                               yolox-m
                               yolox-l
                               yolox-x
```
* -d: number of gpu devices
* -b: total batch size, the recommended number for -b is num-gpu * 8
* --fp16: mixed precision training
* --cache: caching imgs into RAM to accelarate training, which need large system RAM.



When using -f, the above commands are equivalent to:
```shell
python -m yolox.tools.train -f exps/default/yolox_s.py -d 8 -b 64 --fp16 -o [--cache]
                               exps/default/yolox_m.py
                               exps/default/yolox_l.py
                               exps/default/yolox_x.py
```

**Multi Machine Training**

We also support multi-nodes training. Just add the following args:
* --num\_machines: num of your total training nodes
* --machine\_rank: specify the rank of each node

Suppose you want to train YOLOX on 2 machines, and your master machines's IP is 123.123.123.123, use port 12312 and TCP.

On master machine, run
```shell
python tools/train.py -n yolox-s -b 128 --dist-url tcp://123.123.123.123:12312 --num_machines 2 --machine_rank 0
```
On the second machine, run
```shell
python tools/train.py -n yolox-s -b 128 --dist-url tcp://123.123.123.123:12312 --num_machines 2 --machine_rank 1
```

**Logging to Weights & Biases**

To log metrics, predictions and model checkpoints to [W&B](https://docs.wandb.ai/guides/integrations/other/yolox) use the command line argument `--logger wandb` and use the prefix "wandb-" to specify arguments for initializing the wandb run.

```shell
python tools/train.py -n yolox-s -d 8 -b 64 --fp16 -o [--cache] --logger wandb wandb-project <project name>
                         yolox-m
                         yolox-l
                         yolox-x
```

An example wandb dashboard is available [here](https://wandb.ai/manan-goel/yolox-nano/runs/3pzfeom0)

**Others**

See more information with the following command:
```shell
python -m yolox.tools.train --help
```

</details>


<details>
<summary>Evaluation</summary>

We support batch testing for fast evaluation:

```shell
python -m yolox.tools.eval -n  yolox-s -c yolox_s.pth -b 64 -d 8 --conf 0.001 [--fp16] [--fuse]
                               yolox-m
                               yolox-l
                               yolox-x
```
* --fuse: fuse conv and bn
* -d: number of GPUs used for evaluation. DEFAULT: All GPUs available will be used.
* -b: total batch size across on all GPUs

To reproduce speed test, we use the following command:
```shell
python -m yolox.tools.eval -n  yolox-s -c yolox_s.pth -b 1 -d 1 --conf 0.001 --fp16 --fuse
                               yolox-m
                               yolox-l
                               yolox-x
```

</details>


<details>
<summary>Tutorials</summary>

*  [Training on custom data](docs/train_custom_data.md)
*  [Caching for custom data](docs/cache.md)
*  [Manipulating training image size](docs/manipulate_training_image_size.md)
*  [Assignment visualization](docs/assignment_visualization.md)
*  [Freezing model](docs/freeze_module.md)

</details>

## Deployment


1. [MegEngine in C++ and Python](./demo/MegEngine)
2. [ONNX export and an ONNXRuntime](./demo/ONNXRuntime)
3. [TensorRT in C++ and Python](./demo/TensorRT)
4. [ncnn in C++ and Java](./demo/ncnn)
5. [OpenVINO in C++ and Python](./demo/OpenVINO)
6. [Accelerate YOLOX inference with nebullvm in Python](./demo/nebullvm)

## Third-party resources
* YOLOX for streaming perception: [StreamYOLO (CVPR 2022 Oral)](https://github.com/yancie-yjr/StreamYOLO)
* The YOLOX-s and YOLOX-nano are Integrated into [ModelScope](https://www.modelscope.cn/home). Try out the Online Demo at [YOLOX-s](https://www.modelscope.cn/models/damo/cv_cspnet_image-object-detection_yolox/summary) and [YOLOX-Nano](https://www.modelscope.cn/models/damo/cv_cspnet_image-object-detection_yolox_nano_coco/summary) respectively 🚀.
* Integrated into [Huggingface Spaces 🤗](https://huggingface.co/spaces) using [Gradio](https://github.com/gradio-app/gradio). Try out the Web Demo: [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/Sultannn/YOLOX-Demo)
* The ncnn android app with video support: [ncnn-android-yolox](https://github.com/FeiGeChuanShu/ncnn-android-yolox) from [FeiGeChuanShu](https://github.com/FeiGeChuanShu)
* YOLOX with Tengine support: [Tengine](https://github.com/OAID/Tengine/blob/tengine-lite/examples/tm_yolox.cpp) from [BUG1989](https://github.com/BUG1989)
* YOLOX + ROS2 Foxy: [YOLOX-ROS](https://github.com/Ar-Ray-code/YOLOX-ROS) from [Ar-Ray](https://github.com/Ar-Ray-code)
* YOLOX Deploy DeepStream: [YOLOX-deepstream](https://github.com/nanmi/YOLOX-deepstream) from [nanmi](https://github.com/nanmi)
* YOLOX MNN/TNN/ONNXRuntime: [YOLOX-MNN](https://github.com/DefTruth/lite.ai.toolkit/blob/main/lite/mnn/cv/mnn_yolox.cpp)、[YOLOX-TNN](https://github.com/DefTruth/lite.ai.toolkit/blob/main/lite/tnn/cv/tnn_yolox.cpp) and [YOLOX-ONNXRuntime C++](https://github.com/DefTruth/lite.ai.toolkit/blob/main/lite/ort/cv/yolox.cpp) from [DefTruth](https://github.com/DefTruth)
* Converting darknet or yolov5 datasets to COCO format for YOLOX: [YOLO2COCO](https://github.com/RapidAI/YOLO2COCO) from [Daniel](https://github.com/znsoftm)

## Cite YOLOX
If you use YOLOX in your research, please cite our work by using the following BibTeX entry:

```latex
 @article{yolox2021,
  title={YOLOX: Exceeding YOLO Series in 2021},
  author={Ge, Zheng and Liu, Songtao and Wang, Feng and Li, Zeming and Sun, Jian},
  journal={arXiv preprint arXiv:2107.08430},
  year={2021}
}
```
## In memory of Dr. Jian Sun
Without the guidance of [Dr. Jian Sun](https://scholar.google.com/citations?user=ALVSZAYAAAAJ), YOLOX would not have been released and open sourced to the community.
The passing away of Dr. Sun is a huge loss to the Computer Vision field. We add this section here to express our remembrance and condolences to our captain Dr. Sun.
It is hoped that every AI practitioner in the world will stick to the belief of "continuous innovation to expand cognitive boundaries, and extraordinary technology to achieve product value" and move forward all the way.

<div align="center"><img src="assets/sunjian.png" width="200"></div>
没有孙剑博士的指导，YOLOX也不会问世并开源给社区使用。
孙剑博士的离去是CV领域的一大损失，我们在此特别添加了这个部分来表达对我们的“船长”孙老师的纪念和哀思。
希望世界上的每个AI从业者秉持着“持续创新拓展认知边界，非凡科技成就产品价值”的观念，一路向前。


================================================
FILE: SECURITY.md
================================================
# Security Policy

## Reporting a Vulnerability

### Types of Security Issues
We actively monitor:  
- Code vulnerabilities (RCE, XSS, authentication bypass)  
- Dependency risks (critical vulnerabilities in project dependencies, such as requirements.txt, pyproject.toml, or equivalent files)  
- Configuration flaws (insecure defaults in deployment scripts)  

### Disclosure Channels (Choose one):

1. **Encrypted Email**  
   Contact: `wangfeng19950315@163.com`  
   *Subject format: `[SECURITY] ModuleName - Brief Description`*

2. **GitHub Private Report**  
   Use GitHub's ["Report a vulnerability"](https://github.com/Megvii-BaseDetection/YOLOX/security/advisories) feature  

3. **Reporting Security Issues**  
   Please report security issues using Create new issue: https://github.com/Megvii-BaseDetection/YOLOX/issues/new


## Response Process  
1. **Acknowledgement**  
   - Initial response within **48 business hours**  
2. **Assessment**  
   - Triage using CVSS v3.1 scoring  
3. **Remediation**  
   - Critical (CVSS ≥9.0): Patch within **7 days**  
   - High (CVSS 7-8.9): Patch within **30 days**  
4. **Public Disclosure**  
   - Published via [GitHub Advisories](https://github.com/Megvii-BaseDetection/YOLOX/security/advisories)  
   - CVE assignment coordinated with [MITRE](https://cveform.mitre.org)
  
## Secure Development Practices  
- Always verify hashes when downloading dependencies:  
  ```bash
  sha256sum -c <your-dependency-hash-file>
  ```


================================================
FILE: demo/MegEngine/cpp/README.md
================================================
# YOLOX-CPP-MegEngine

Cpp file compile of YOLOX object detection base on [MegEngine](https://github.com/MegEngine/MegEngine).

## Tutorial

### Step1: install toolchain

	* host: sudo apt install gcc/g++ (gcc/g++, which version >= 6) build-essential git git-lfs gfortran libgfortran-6-dev autoconf gnupg flex bison gperf curl zlib1g-dev gcc-multilib g++-multilib cmake
 * cross build android: download [NDK](https://developer.android.com/ndk/downloads)
   	* after unzip download NDK, then export NDK_ROOT="path of NDK"

### Step2: build MegEngine

```shell
git clone https://github.com/MegEngine/MegEngine.git

# then init third_party
 
export megengine_root="path of MegEngine"
cd $megengine_root && ./third_party/prepare.sh && ./third_party/install-mkl.sh

# build example:
# build host without cuda:   
./scripts/cmake-build/host_build.sh
# or build host with cuda:
./scripts/cmake-build/host_build.sh -c
# or cross build for android aarch64: 
./scripts/cmake-build/cross_build_android_arm_inference.sh
# or cross build for android aarch64(with V8.2+fp16): 
./scripts/cmake-build/cross_build_android_arm_inference.sh -f

# after build MegEngine, you need export the `MGE_INSTALL_PATH`
# host without cuda: 
export MGE_INSTALL_PATH=${megengine_root}/build_dir/host/MGE_WITH_CUDA_OFF/MGE_INFERENCE_ONLY_ON/Release/install
# or host with cuda: 
export MGE_INSTALL_PATH=${megengine_root}/build_dir/host/MGE_WITH_CUDA_ON/MGE_INFERENCE_ONLY_ON/Release/install
# or cross build for android aarch64: 
export MGE_INSTALL_PATH=${megengine_root}/build_dir/android/arm64-v8a/Release/install
```
* you can refs [build tutorial of MegEngine](https://github.com/MegEngine/MegEngine/blob/master/scripts/cmake-build/BUILD_README.md) to build other platform, eg, windows/macos/ etc!

### Step3: build OpenCV

```shell
git clone https://github.com/opencv/opencv.git

git checkout 3.4.15 (we test at 3.4.15, if test other version, may need modify some build)
```

- patch diff for android:

```
# ```
#     diff --git a/CMakeLists.txt b/CMakeLists.txt
#     index f6a2da5310..10354312c9 100644
#     --- a/CMakeLists.txt
#     +++ b/CMakeLists.txt
#     @@ -643,7 +643,7 @@ if(UNIX)
#        if(NOT APPLE)
#          CHECK_INCLUDE_FILE(pthread.h HAVE_PTHREAD)
#          if(ANDROID)
#     -      set(OPENCV_LINKER_LIBS ${OPENCV_LINKER_LIBS} dl m log)
#     +      set(OPENCV_LINKER_LIBS ${OPENCV_LINKER_LIBS} dl m log z)
#          elseif(CMAKE_SYSTEM_NAME MATCHES "FreeBSD|NetBSD|DragonFly|OpenBSD|Haiku")
#            set(OPENCV_LINKER_LIBS ${OPENCV_LINKER_LIBS} m pthread)
#          elseif(EMSCRIPTEN)
    
# ```
```

- build for host

```shell
cd root_dir_of_opencv
mkdir -p build/install
cd build
cmake -DBUILD_JAVA=OFF -DBUILD_SHARED_LIBS=ON -DCMAKE_INSTALL_PREFIX=$PWD/install 
make install -j32
```

* build for android-aarch64

```shell
cd root_dir_of_opencv
mkdir -p build_android/install
cd build_android

cmake -DCMAKE_TOOLCHAIN_FILE="$NDK_ROOT/build/cmake/android.toolchain.cmake" -DANDROID_NDK="$NDK_ROOT"  -DANDROID_ABI=arm64-v8a -DANDROID_NATIVE_API_LEVEL=21 -DBUILD_JAVA=OFF -DBUILD_ANDROID_PROJECTS=OFF -DBUILD_ANDROID_EXAMPLES=OFF -DBUILD_SHARED_LIBS=ON -DCMAKE_INSTALL_PREFIX=$PWD/install ..

make install -j32
```

* after build OpenCV, you need export  `OPENCV_INSTALL_INCLUDE_PATH ` and `OPENCV_INSTALL_LIB_PATH`

```shell
# host build: 
export OPENCV_INSTALL_INCLUDE_PATH=${path of opencv}/build/install/include
export OPENCV_INSTALL_LIB_PATH=${path of opencv}/build/install/lib
# or cross build for android aarch64:
export OPENCV_INSTALL_INCLUDE_PATH=${path of opencv}/build_android/install/sdk/native/jni/include
export OPENCV_INSTALL_LIB_PATH=${path of opencv}/build_android/install/sdk/native/libs/arm64-v8a
```

###  Step4: build test demo

```shell
run build.sh

# if host:
export CXX=g++
./build.sh
# or cross android aarch64
export CXX=aarch64-linux-android21-clang++
./build.sh
```

### Step5: run demo

> **Note**: two ways to get `yolox_s.mge` model file
>
> * reference to python demo's `dump.py` script.
> * For users with code before 0.1.0 version, wget yolox-s weights [here](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_s.mge).
> * For users with code after 0.1.0 version, use [python code in megengine](../python) to generate mge file.

```shell
# if host:
LD_LIBRARY_PATH=$MGE_INSTALL_PATH/lib/:$OPENCV_INSTALL_LIB_PATH ./yolox yolox_s.mge ../../../assets/dog.jpg cuda/cpu/multithread <warmup_count> <thread_number>

# or cross android
adb push/scp $MGE_INSTALL_PATH/lib/libmegengine.so android_phone
adb push/scp $OPENCV_INSTALL_LIB_PATH/*.so android_phone
adb push/scp ./yolox yolox_s.mge android_phone
adb push/scp ../../../assets/dog.jpg android_phone

# login in android_phone by adb or ssh
# then run: 
LD_LIBRARY_PATH=. ./yolox yolox_s.mge dog.jpg cpu/multithread <warmup_count> <thread_number> <use_fast_run> <use_weight_preprocess>  <run_with_fp16>

# * <warmup_count> means warmup count, valid number >=0
# * <thread_number> means thread number, valid number >=1, only take effect `multithread` device
# * <use_fast_run> if >=1 , will use fastrun to choose best algo
# * <use_weight_preprocess> if >=1, will handle weight preprocess before exe
# * <run_with_fp16> if >=1, will run with fp16 mode
```

## Bechmark

* model info: yolox-s @ input(1,3,640,640)					

* test devices

```
  * x86_64  -- Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz					
  * aarch64 -- xiamo phone mi9					
  * cuda    -- 1080TI @ cuda-10.1-cudnn-v7.6.3-TensorRT-6.0.1.5.sh @ Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
```

  | megengine @ tag1.4(fastrun + weight\_preprocess)/sec | 1 thread |
  | ---------------------------------------------------- | -------- |
  | x86\_64                                              | 0.516245 |
  | aarch64(fp32+chw44)                                  | 0.587857 |

  | CUDA @ 1080TI/sec   | 1 batch    | 2 batch   | 4 batch   | 8 batch   | 16 batch  | 32 batch | 64 batch |
  | ------------------- | ---------- | --------- | --------- | --------- | --------- | -------- | -------- |
  | megengine(fp32+chw) | 0.00813703 | 0.0132893 | 0.0236633 | 0.0444699 | 0.0864917 | 0.16895  | 0.334248 |

## Acknowledgement

* [MegEngine](https://github.com/MegEngine/MegEngine)
* [OpenCV](https://github.com/opencv/opencv)
* [NDK](https://developer.android.com/ndk)
* [CMAKE](https://cmake.org/)


================================================
FILE: demo/MegEngine/cpp/build.sh
================================================
#!/usr/bin/env bash
set -e

if [ -z $CXX ];then
    echo "please export you c++ toolchain to CXX"
    echo "for example:"
    echo "build for host:                                        export CXX=g++"
    echo "cross build for aarch64-android(always locate in NDK): export CXX=aarch64-linux-android21-clang++"
    echo "cross build for aarch64-linux:                         export CXX=aarch64-linux-gnu-g++"
    exit -1
fi

if [ -z $MGE_INSTALL_PATH ];then
    echo "please refsi ./README.md to init MGE_INSTALL_PATH env"
    exit -1
fi

if [ -z $OPENCV_INSTALL_INCLUDE_PATH ];then
    echo "please refs ./README.md to init OPENCV_INSTALL_INCLUDE_PATH env"
    exit -1
fi

if [ -z $OPENCV_INSTALL_LIB_PATH ];then
    echo "please refs ./README.md to init OPENCV_INSTALL_LIB_PATH env"
    exit -1
fi

INCLUDE_FLAG="-I$MGE_INSTALL_PATH/include -I$OPENCV_INSTALL_INCLUDE_PATH"
LINK_FLAG="-L$MGE_INSTALL_PATH/lib/ -lmegengine -L$OPENCV_INSTALL_LIB_PATH -lopencv_core -lopencv_highgui -lopencv_imgproc -lopencv_imgcodecs"
BUILD_FLAG="-static-libstdc++ -O3 -pie -fPIE -g"

if [[ $CXX =~ "android" ]]; then
    LINK_FLAG="${LINK_FLAG} -llog -lz"
fi

echo "CXX: $CXX"
echo "MGE_INSTALL_PATH: $MGE_INSTALL_PATH"
echo "INCLUDE_FLAG: $INCLUDE_FLAG"
echo "LINK_FLAG: $LINK_FLAG"
echo "BUILD_FLAG: $BUILD_FLAG"

echo "[" > compile_commands.json
echo "{" >> compile_commands.json
echo "\"directory\": \"$PWD\"," >> compile_commands.json
echo "\"command\": \"$CXX yolox.cpp -o yolox ${INCLUDE_FLAG} ${LINK_FLAG}\"," >> compile_commands.json
echo "\"file\": \"$PWD/yolox.cpp\"," >> compile_commands.json
echo "}," >> compile_commands.json
echo "]" >> compile_commands.json
$CXX yolox.cpp -o yolox ${INCLUDE_FLAG} ${LINK_FLAG} ${BUILD_FLAG}

echo "build success, output file: yolox"
if [[ $CXX =~ "android" ]]; then
    echo "try command to run:"
    echo "adb push/scp $MGE_INSTALL_PATH/lib/libmegengine.so android_phone"
    echo "adb push/scp $OPENCV_INSTALL_LIB_PATH/*.so android_phone"
    echo "adb push/scp ./yolox yolox_s.mge android_phone"
    echo "adb push/scp ../../../assets/dog.jpg android_phone"
    echo "adb/ssh to android_phone, then run: LD_LIBRARY_PATH=. ./yolox yolox_s.mge dog.jpg cpu/multithread <warmup_count> <thread_number> <use_fast_run> <use_weight_preprocess>"
else
    echo "try command to run: LD_LIBRARY_PATH=$MGE_INSTALL_PATH/lib/:$OPENCV_INSTALL_LIB_PATH ./yolox yolox_s.mge ../../../assets/dog.jpg cuda/cpu/multithread <warmup_count> <thread_number> <use_fast_run> <use_weight_preprocess>"
fi


================================================
FILE: demo/MegEngine/cpp/yolox.cpp
================================================
// Copyright (C) 2018-2021 Intel Corporation
// SPDX-License-Identifier: Apache-2.0

#include "megbrain/gopt/inference.h"
#include "megbrain/opr/search_policy/algo_chooser_helper.h"
#include "megbrain/serialization/serializer.h"
#include <iostream>
#include <iterator>
#include <memory>
#include <opencv2/opencv.hpp>
#include <stdlib.h>
#include <string>
#include <vector>

/**
 * @brief Define names based depends on Unicode path support
 */
#define NMS_THRESH 0.45
#define BBOX_CONF_THRESH 0.25

constexpr int INPUT_W = 640;
constexpr int INPUT_H = 640;

using namespace mgb;

cv::Mat static_resize(cv::Mat &img) {
  float r = std::min(INPUT_W / (img.cols * 1.0), INPUT_H / (img.rows * 1.0));
  int unpad_w = r * img.cols;
  int unpad_h = r * img.rows;
  cv::Mat re(unpad_h, unpad_w, CV_8UC3);
  cv::resize(img, re, re.size());
  cv::Mat out(INPUT_W, INPUT_H, CV_8UC3, cv::Scalar(114, 114, 114));
  re.copyTo(out(cv::Rect(0, 0, re.cols, re.rows)));
  return out;
}

void blobFromImage(cv::Mat &img, float *blob_data) {
  int channels = 3;
  int img_h = img.rows;
  int img_w = img.cols;
  for (size_t c = 0; c < channels; c++) {
    for (size_t h = 0; h < img_h; h++) {
      for (size_t w = 0; w < img_w; w++) {
        blob_data[c * img_w * img_h + h * img_w + w] =
            (float)img.at<cv::Vec3b>(h, w)[c];
      }
    }
  }
}

struct Object {
  cv::Rect_<float> rect;
  int label;
  float prob;
};

struct GridAndStride {
  int grid0;
  int grid1;
  int stride;
};

static void
generate_grids_and_stride(const int target_size, std::vector<int> &strides,
                          std::vector<GridAndStride> &grid_strides) {
  for (auto stride : strides) {
    int num_grid = target_size / stride;
    for (int g1 = 0; g1 < num_grid; g1++) {
      for (int g0 = 0; g0 < num_grid; g0++) {
        grid_strides.push_back((GridAndStride){g0, g1, stride});
      }
    }
  }
}

static void generate_yolox_proposals(std::vector<GridAndStride> grid_strides,
                                     const float *feat_ptr,
                                     float prob_threshold,
                                     std::vector<Object> &objects) {
  const int num_class = 80;
  const int num_anchors = grid_strides.size();

  for (int anchor_idx = 0; anchor_idx < num_anchors; anchor_idx++) {
    const int grid0 = grid_strides[anchor_idx].grid0;
    const int grid1 = grid_strides[anchor_idx].grid1;
    const int stride = grid_strides[anchor_idx].stride;

    const int basic_pos = anchor_idx * 85;

    float x_center = (feat_ptr[basic_pos + 0] + grid0) * stride;
    float y_center = (feat_ptr[basic_pos + 1] + grid1) * stride;
    float w = exp(feat_ptr[basic_pos + 2]) * stride;
    float h = exp(feat_ptr[basic_pos + 3]) * stride;
    float x0 = x_center - w * 0.5f;
    float y0 = y_center - h * 0.5f;

    float box_objectness = feat_ptr[basic_pos + 4];
    for (int class_idx = 0; class_idx < num_class; class_idx++) {
      float box_cls_score = feat_ptr[basic_pos + 5 + class_idx];
      float box_prob = box_objectness * box_cls_score;
      if (box_prob > prob_threshold) {
        Object obj;
        obj.rect.x = x0;
        obj.rect.y = y0;
        obj.rect.width = w;
        obj.rect.height = h;
        obj.label = class_idx;
        obj.prob = box_prob;

        objects.push_back(obj);
      }

    } // class loop

  } // point anchor loop
}

static inline float intersection_area(const Object &a, const Object &b) {
  cv::Rect_<float> inter = a.rect & b.rect;
  return inter.area();
}

static void qsort_descent_inplace(std::vector<Object> &faceobjects, int left,
                                  int right) {
  int i = left;
  int j = right;
  float p = faceobjects[(left + right) / 2].prob;

  while (i <= j) {
    while (faceobjects[i].prob > p)
      i++;

    while (faceobjects[j].prob < p)
      j--;

    if (i <= j) {
      // swap
      std::swap(faceobjects[i], faceobjects[j]);

      i++;
      j--;
    }
  }

#pragma omp parallel sections
  {
#pragma omp section
    {
      if (left < j)
        qsort_descent_inplace(faceobjects, left, j);
    }
#pragma omp section
    {
      if (i < right)
        qsort_descent_inplace(faceobjects, i, right);
    }
  }
}

static void qsort_descent_inplace(std::vector<Object> &objects) {
  if (objects.empty())
    return;

  qsort_descent_inplace(objects, 0, objects.size() - 1);
}

static void nms_sorted_bboxes(const std::vector<Object> &faceobjects,
                              std::vector<int> &picked, float nms_threshold) {
  picked.clear();

  const int n = faceobjects.size();

  std::vector<float> areas(n);
  for (int i = 0; i < n; i++) {
    areas[i] = faceobjects[i].rect.area();
  }

  for (int i = 0; i < n; i++) {
    const Object &a = faceobjects[i];

    int keep = 1;
    for (int j = 0; j < (int)picked.size(); j++) {
      const Object &b = faceobjects[picked[j]];

      // intersection over union
      float inter_area = intersection_area(a, b);
      float union_area = areas[i] + areas[picked[j]] - inter_area;
      // float IoU = inter_area / union_area
      if (inter_area / union_area > nms_threshold)
        keep = 0;
    }

    if (keep)
      picked.push_back(i);
  }
}

static void decode_outputs(const float *prob, std::vector<Object> &objects,
                           float scale, const int img_w, const int img_h) {
  std::vector<Object> proposals;
  std::vector<int> strides = {8, 16, 32};
  std::vector<GridAndStride> grid_strides;

  generate_grids_and_stride(INPUT_W, strides, grid_strides);
  generate_yolox_proposals(grid_strides, prob, BBOX_CONF_THRESH, proposals);
  qsort_descent_inplace(proposals);

  std::vector<int> picked;
  nms_sorted_bboxes(proposals, picked, NMS_THRESH);
  int count = picked.size();
  objects.resize(count);

  for (int i = 0; i < count; i++) {
    objects[i] = proposals[picked[i]];

    // adjust offset to original unpadded
    float x0 = (objects[i].rect.x) / scale;
    float y0 = (objects[i].rect.y) / scale;
    float x1 = (objects[i].rect.x + objects[i].rect.width) / scale;
    float y1 = (objects[i].rect.y + objects[i].rect.height) / scale;

    // clip
    x0 = std::max(std::min(x0, (float)(img_w - 1)), 0.f);
    y0 = std::max(std::min(y0, (float)(img_h - 1)), 0.f);
    x1 = std::max(std::min(x1, (float)(img_w - 1)), 0.f);
    y1 = std::max(std::min(y1, (float)(img_h - 1)), 0.f);

    objects[i].rect.x = x0;
    objects[i].rect.y = y0;
    objects[i].rect.width = x1 - x0;
    objects[i].rect.height = y1 - y0;
  }
}

const float color_list[80][3] = {
    {0.000, 0.447, 0.741}, {0.850, 0.325, 0.098}, {0.929, 0.694, 0.125},
    {0.494, 0.184, 0.556}, {0.466, 0.674, 0.188}, {0.301, 0.745, 0.933},
    {0.635, 0.078, 0.184}, {0.300, 0.300, 0.300}, {0.600, 0.600, 0.600},
    {1.000, 0.000, 0.000}, {1.000, 0.500, 0.000}, {0.749, 0.749, 0.000},
    {0.000, 1.000, 0.000}, {0.000, 0.000, 1.000}, {0.667, 0.000, 1.000},
    {0.333, 0.333, 0.000}, {0.333, 0.667, 0.000}, {0.333, 1.000, 0.000},
    {0.667, 0.333, 0.000}, {0.667, 0.667, 0.000}, {0.667, 1.000, 0.000},
    {1.000, 0.333, 0.000}, {1.000, 0.667, 0.000}, {1.000, 1.000, 0.000},
    {0.000, 0.333, 0.500}, {0.000, 0.667, 0.500}, {0.000, 1.000, 0.500},
    {0.333, 0.000, 0.500}, {0.333, 0.333, 0.500}, {0.333, 0.667, 0.500},
    {0.333, 1.000, 0.500}, {0.667, 0.000, 0.500}, {0.667, 0.333, 0.500},
    {0.667, 0.667, 0.500}, {0.667, 1.000, 0.500}, {1.000, 0.000, 0.500},
    {1.000, 0.333, 0.500}, {1.000, 0.667, 0.500}, {1.000, 1.000, 0.500},
    {0.000, 0.333, 1.000}, {0.000, 0.667, 1.000}, {0.000, 1.000, 1.000},
    {0.333, 0.000, 1.000}, {0.333, 0.333, 1.000}, {0.333, 0.667, 1.000},
    {0.333, 1.000, 1.000}, {0.667, 0.000, 1.000}, {0.667, 0.333, 1.000},
    {0.667, 0.667, 1.000}, {0.667, 1.000, 1.000}, {1.000, 0.000, 1.000},
    {1.000, 0.333, 1.000}, {1.000, 0.667, 1.000}, {0.333, 0.000, 0.000},
    {0.500, 0.000, 0.000}, {0.667, 0.000, 0.000}, {0.833, 0.000, 0.000},
    {1.000, 0.000, 0.000}, {0.000, 0.167, 0.000}, {0.000, 0.333, 0.000},
    {0.000, 0.500, 0.000}, {0.000, 0.667, 0.000}, {0.000, 0.833, 0.000},
    {0.000, 1.000, 0.000}, {0.000, 0.000, 0.167}, {0.000, 0.000, 0.333},
    {0.000, 0.000, 0.500}, {0.000, 0.000, 0.667}, {0.000, 0.000, 0.833},
    {0.000, 0.000, 1.000}, {0.000, 0.000, 0.000}, {0.143, 0.143, 0.143},
    {0.286, 0.286, 0.286}, {0.429, 0.429, 0.429}, {0.571, 0.571, 0.571},
    {0.714, 0.714, 0.714}, {0.857, 0.857, 0.857}, {0.000, 0.447, 0.741},
    {0.314, 0.717, 0.741}, {0.50, 0.5, 0}};

static void draw_objects(const cv::Mat &bgr,
                         const std::vector<Object> &objects) {
  static const char *class_names[] = {
      "person",        "bicycle",      "car",
      "motorcycle",    "airplane",     "bus",
      "train",         "truck",        "boat",
      "traffic light", "fire hydrant", "stop sign",
      "parking meter", "bench",        "bird",
      "cat",           "dog",          "horse",
      "sheep",         "cow",          "elephant",
      "bear",          "zebra",        "giraffe",
      "backpack",      "umbrella",     "handbag",
      "tie",           "suitcase",     "frisbee",
      "skis",          "snowboard",    "sports ball",
      "kite",          "baseball bat", "baseball glove",
      "skateboard",    "surfboard",    "tennis racket",
      "bottle",        "wine glass",   "cup",
      "fork",          "knife",        "spoon",
      "bowl",          "banana",       "apple",
      "sandwich",      "orange",       "broccoli",
      "carrot",        "hot dog",      "pizza",
      "donut",         "cake",         "chair",
      "couch",         "potted plant", "bed",
      "dining table",  "toilet",       "tv",
      "laptop",        "mouse",        "remote",
      "keyboard",      "cell phone",   "microwave",
      "oven",          "toaster",      "sink",
      "refrigerator",  "book",         "clock",
      "vase",          "scissors",     "teddy bear",
      "hair drier",    "toothbrush"};

  cv::Mat image = bgr.clone();

  for (size_t i = 0; i < objects.size(); i++) {
    const Object &obj = objects[i];

    fprintf(stderr, "%d = %.5f at %.2f %.2f %.2f x %.2f\n", obj.label, obj.prob,
            obj.rect.x, obj.rect.y, obj.rect.width, obj.rect.height);

    cv::Scalar color =
        cv::Scalar(color_list[obj.label][0], color_list[obj.label][1],
                   color_list[obj.label][2]);
    float c_mean = cv::mean(color)[0];
    cv::Scalar txt_color;
    if (c_mean > 0.5) {
      txt_color = cv::Scalar(0, 0, 0);
    } else {
      txt_color = cv::Scalar(255, 255, 255);
    }

    cv::rectangle(image, obj.rect, color * 255, 2);

    char text[256];
    sprintf(text, "%s %.1f%%", class_names[obj.label], obj.prob * 100);

    int baseLine = 0;
    cv::Size label_size =
        cv::getTextSize(text, cv::FONT_HERSHEY_SIMPLEX, 0.4, 1, &baseLine);

    cv::Scalar txt_bk_color = color * 0.7 * 255;

    int x = obj.rect.x;
    int y = obj.rect.y + 1;
    // int y = obj.rect.y - label_size.height - baseLine;
    if (y > image.rows)
      y = image.rows;
    // if (x + label_size.width > image.cols)
    // x = image.cols - label_size.width;

    cv::rectangle(
        image,
        cv::Rect(cv::Point(x, y),
                 cv::Size(label_size.width, label_size.height + baseLine)),
        txt_bk_color, -1);

    cv::putText(image, text, cv::Point(x, y + label_size.height),
                cv::FONT_HERSHEY_SIMPLEX, 0.4, txt_color, 1);
  }

  cv::imwrite("out.jpg", image);
  std::cout << "save output to out.jpg" << std::endl;
}

cg::ComputingGraph::OutputSpecItem make_callback_copy(SymbolVar dev,
                                                      HostTensorND &host) {
  auto cb = [&host](DeviceTensorND &d) { host.copy_from(d); };
  return {dev, cb};
}

int main(int argc, char *argv[]) {
  serialization::GraphLoader::LoadConfig load_config;
  load_config.comp_graph = ComputingGraph::make();
  auto &&graph_opt = load_config.comp_graph->options();
  graph_opt.graph_opt_level = 0;

  if (argc != 9) {
    std::cout << "Usage : " << argv[0]
              << " <path_to_model> <path_to_image> <device> <warmup_count> "
                 "<thread_number> <use_fast_run> <use_weight_preprocess> "
                 "<run_with_fp16>"
              << std::endl;
    return EXIT_FAILURE;
  }

  const std::string input_model{argv[1]};
  const std::string input_image_path{argv[2]};
  const std::string device{argv[3]};
  const size_t warmup_count = atoi(argv[4]);
  const size_t thread_number = atoi(argv[5]);
  const size_t use_fast_run = atoi(argv[6]);
  const size_t use_weight_preprocess = atoi(argv[7]);
  const size_t run_with_fp16 = atoi(argv[8]);

  if (device == "cuda") {
    load_config.comp_node_mapper = [](CompNode::Locator &loc) {
      loc.type = CompNode::DeviceType::CUDA;
    };
  } else if (device == "cpu") {
    load_config.comp_node_mapper = [](CompNode::Locator &loc) {
      loc.type = CompNode::DeviceType::CPU;
    };
  } else if (device == "multithread") {
    load_config.comp_node_mapper = [thread_number](CompNode::Locator &loc) {
      loc.type = CompNode::DeviceType::MULTITHREAD;
      loc.device = 0;
      loc.stream = thread_number;
    };
    std::cout << "use " << thread_number << " thread" << std::endl;
  } else {
    std::cout << "device only support cuda or cpu or multithread" << std::endl;
    return EXIT_FAILURE;
  }

  if (use_weight_preprocess) {
    std::cout << "use weight preprocess" << std::endl;
    graph_opt.graph_opt.enable_weight_preprocess();
  }
  if (run_with_fp16) {
    std::cout << "run with fp16" << std::endl;
    graph_opt.graph_opt.enable_f16_io_comp();
  }

  if (device == "cuda") {
    std::cout << "choose format for cuda" << std::endl;
  } else {
    std::cout << "choose format for non-cuda" << std::endl;
#if defined(__arm__) || defined(__aarch64__)
    if (run_with_fp16) {
      std::cout << "use chw format when enable fp16" << std::endl;
    } else {
      std::cout << "choose format for nchw44 for aarch64" << std::endl;
      graph_opt.graph_opt.enable_nchw44();
    }
#endif
#if defined(__x86_64__) || defined(__amd64__) || defined(__i386__)
    // graph_opt.graph_opt.enable_nchw88();
#endif
  }

  std::unique_ptr<serialization::InputFile> inp_file =
      serialization::InputFile::make_fs(input_model.c_str());
  auto loader = serialization::GraphLoader::make(std::move(inp_file));
  serialization::GraphLoader::LoadResult network =
      loader->load(load_config, false);

  if (use_fast_run) {
    std::cout << "use fastrun" << std::endl;
    using S = opr::mixin::AlgoChooserHelper::ExecutionPolicy::Strategy;
    S strategy = static_cast<S>(0);
    strategy = S::PROFILE | S::OPTIMIZED | strategy;
    mgb::gopt::modify_opr_algo_strategy_inplace(network.output_var_list,
                                                strategy);
  }

  auto data = network.tensor_map["data"];
  cv::Mat image = cv::imread(input_image_path);
  cv::Mat pr_img = static_resize(image);
  float *data_ptr = data->resize({1, 3, 640, 640}).ptr<float>();
  blobFromImage(pr_img, data_ptr);
  HostTensorND predict;
  std::unique_ptr<cg::AsyncExecutable> func = network.graph->compile(
      {make_callback_copy(network.output_var_map.begin()->second, predict)});

  for (auto i = 0; i < warmup_count; i++) {
    std::cout << "warmup: " << i << std::endl;
    func->execute();
    func->wait();
  }
  auto start = std::chrono::system_clock::now();
  func->execute();
  func->wait();
  auto end = std::chrono::system_clock::now();
  std::chrono::duration<double> exec_seconds = end - start;
  std::cout << "elapsed time: " << exec_seconds.count() << "s" << std::endl;

  float *predict_ptr = predict.ptr<float>();
  int img_w = image.cols;
  int img_h = image.rows;
  float scale =
      std::min(INPUT_W / (image.cols * 1.0), INPUT_H / (image.rows * 1.0));
  std::vector<Object> objects;

  decode_outputs(predict_ptr, objects, scale, img_w, img_h);
  draw_objects(image, objects);

  return EXIT_SUCCESS;
}


================================================
FILE: demo/MegEngine/python/README.md
================================================
# YOLOX-Python-MegEngine

Python version of YOLOX object detection base on [MegEngine](https://github.com/MegEngine/MegEngine).

## Tutorial

### Step1: install requirements

```
python3 -m pip install megengine -f https://megengine.org.cn/whl/mge.html
```

### Step2: convert checkpoint weights from torch's path file

```
python3 convert_weights.py -w yolox_s.pth -o yolox_s_mge.pkl
```

### Step3: run demo

This part is the same as torch's python demo, but no need to specify device.

```
python3 demo.py image -n yolox-s -c yolox_s_mge.pkl --path ../../../assets/dog.jpg --conf 0.25 --nms 0.45 --tsize 640 --save_result
```

###  [Optional]Step4: dump model for cpp inference

> **Note**: result model is dumped with `optimize_for_inference` and `enable_fuse_conv_bias_nonlinearity`.

```
python3 dump.py -n yolox-s -c yolox_s_mge.pkl --dump_path yolox_s.mge
```


================================================
FILE: demo/MegEngine/python/build.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-

import megengine as mge
import megengine.module as M

from models.yolo_fpn import YOLOFPN
from models.yolo_head import YOLOXHead
from models.yolo_pafpn import YOLOPAFPN
from models.yolox import YOLOX


def build_yolox(name="yolox-s"):
    num_classes = 80

    # value meaning: depth, width
    param_dict = {
        "yolox-nano": (0.33, 0.25),
        "yolox-tiny": (0.33, 0.375),
        "yolox-s": (0.33, 0.50),
        "yolox-m": (0.67, 0.75),
        "yolox-l": (1.0, 1.0),
        "yolox-x": (1.33, 1.25),
    }
    if name == "yolov3":
        depth = 1.0
        width = 1.0
        backbone = YOLOFPN()
        head = YOLOXHead(num_classes, width, in_channels=[128, 256, 512], act="lrelu")
        model = YOLOX(backbone, head)
    else:
        assert name in param_dict
        kwargs = {}
        depth, width = param_dict[name]
        if name == "yolox-nano":
            kwargs["depthwise"] = True
        in_channels = [256, 512, 1024]
        backbone = YOLOPAFPN(depth, width, in_channels=in_channels, **kwargs)
        head = YOLOXHead(num_classes, width, in_channels=in_channels, **kwargs)
        model = YOLOX(backbone, head)

    for m in model.modules():
        if isinstance(m, M.BatchNorm2d):
            m.eps = 1e-3

    return model


def build_and_load(weight_file, name="yolox-s"):
    model = build_yolox(name)
    model_weights = mge.load(weight_file)
    model.load_state_dict(model_weights, strict=False)
    return model


================================================
FILE: demo/MegEngine/python/convert_weights.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-
import argparse
from collections import OrderedDict

import megengine as mge
import torch


def make_parser():
    parser = argparse.ArgumentParser()
    parser.add_argument("-w", "--weights", type=str, help="path of weight file")
    parser.add_argument(
        "-o",
        "--output",
        default="weight_mge.pkl",
        type=str,
        help="path of weight file",
    )
    return parser


def numpy_weights(weight_file):
    torch_weights = torch.load(weight_file, map_location="cpu")
    if "model" in torch_weights:
        torch_weights = torch_weights["model"]
    new_dict = OrderedDict()
    for k, v in torch_weights.items():
        new_dict[k] = v.cpu().numpy()
    return new_dict


def map_weights(weight_file, output_file):
    torch_weights = numpy_weights(weight_file)

    new_dict = OrderedDict()
    for k, v in torch_weights.items():
        if "num_batches_tracked" in k:
            print("drop: {}".format(k))
            continue
        if k.endswith("bias"):
            print("bias key: {}".format(k))
            v = v.reshape(1, -1, 1, 1)
            new_dict[k] = v
        elif "dconv" in k and "conv.weight" in k:
            print("depthwise conv key: {}".format(k))
            cout, cin, k1, k2 = v.shape
            v = v.reshape(cout, 1, cin, k1, k2)
            new_dict[k] = v
        else:
            new_dict[k] = v

    mge.save(new_dict, output_file)
    print("save weights to {}".format(output_file))


def main():
    parser = make_parser()
    args = parser.parse_args()
    map_weights(args.weights, args.output)


if __name__ == "__main__":
    main()


================================================
FILE: demo/MegEngine/python/demo.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-
# Copyright (c) Megvii, Inc. and its affiliates.

import argparse
import os
import time

import cv2
import megengine as mge
import megengine.functional as F
from loguru import logger

from yolox.data.datasets import COCO_CLASSES
from yolox.utils import vis
from yolox.data.data_augment import preproc as preprocess

from build import build_and_load

IMAGE_EXT = [".jpg", ".jpeg", ".webp", ".bmp", ".png"]


def make_parser():
    parser = argparse.ArgumentParser("YOLOX Demo!")
    parser.add_argument(
        "demo", default="image", help="demo type, eg. image, video and webcam"
    )
    parser.add_argument("-n", "--name", type=str, default="yolox-s", help="model name")
    parser.add_argument("--path", default="./test.png", help="path to images or video")
    parser.add_argument("--camid", type=int, default=0, help="webcam demo camera id")
    parser.add_argument(
        "--save_result",
        action="store_true",
        help="whether to save the inference result of image/video",
    )

    parser.add_argument("-c", "--ckpt", default=None, type=str, help="ckpt for eval")
    parser.add_argument("--conf", default=None, type=float, help="test conf")
    parser.add_argument("--nms", default=None, type=float, help="test nms threshold")
    parser.add_argument("--tsize", default=None, type=int, help="test img size")
    return parser


def get_image_list(path):
    image_names = []
    for maindir, subdir, file_name_list in os.walk(path):
        for filename in file_name_list:
            apath = os.path.join(maindir, filename)
            ext = os.path.splitext(apath)[1]
            if ext in IMAGE_EXT:
                image_names.append(apath)
    return image_names


def postprocess(prediction, num_classes, conf_thre=0.7, nms_thre=0.45):
    box_corner = F.zeros_like(prediction)
    box_corner[:, :, 0] = prediction[:, :, 0] - prediction[:, :, 2] / 2
    box_corner[:, :, 1] = prediction[:, :, 1] - prediction[:, :, 3] / 2
    box_corner[:, :, 2] = prediction[:, :, 0] + prediction[:, :, 2] / 2
    box_corner[:, :, 3] = prediction[:, :, 1] + prediction[:, :, 3] / 2
    prediction[:, :, :4] = box_corner[:, :, :4]

    output = [None for _ in range(len(prediction))]
    for i, image_pred in enumerate(prediction):

        # If none are remaining => process next image
        if not image_pred.shape[0]:
            continue
        # Get score and class with highest confidence
        class_conf = F.max(image_pred[:, 5: 5 + num_classes], 1, keepdims=True)
        class_pred = F.argmax(image_pred[:, 5: 5 + num_classes], 1, keepdims=True)

        class_conf_squeeze = F.squeeze(class_conf)
        conf_mask = image_pred[:, 4] * class_conf_squeeze >= conf_thre
        detections = F.concat((image_pred[:, :5], class_conf, class_pred), 1)
        detections = detections[conf_mask]
        if not detections.shape[0]:
            continue

        nms_out_index = F.vision.nms(
            detections[:, :4], detections[:, 4] * detections[:, 5], nms_thre,
        )
        detections = detections[nms_out_index]
        if output[i] is None:
            output[i] = detections
        else:
            output[i] = F.concat((output[i], detections))

    return output


class Predictor(object):
    def __init__(
        self,
        model,
        confthre=0.01,
        nmsthre=0.65,
        test_size=(640, 640),
        cls_names=COCO_CLASSES,
        trt_file=None,
        decoder=None,
    ):
        self.model = model
        self.cls_names = cls_names
        self.decoder = decoder
        self.num_classes = 80
        self.confthre = confthre
        self.nmsthre = nmsthre
        self.test_size = test_size

    def inference(self, img):
        img_info = {"id": 0}
        if isinstance(img, str):
            img_info["file_name"] = os.path.basename(img)
            img = cv2.imread(img)
            if img is None:
                raise ValueError("test image path is invalid!")
        else:
            img_info["file_name"] = None

        height, width = img.shape[:2]
        img_info["height"] = height
        img_info["width"] = width
        img_info["raw_img"] = img

        img, ratio = preprocess(img, self.test_size)
        img_info["ratio"] = ratio
        img = F.expand_dims(mge.tensor(img), 0)

        t0 = time.time()
        outputs = self.model(img)
        outputs = postprocess(outputs, self.num_classes, self.confthre, self.nmsthre)
        logger.info("Infer time: {:.4f}s".format(time.time() - t0))
        return outputs, img_info

    def visual(self, output, img_info, cls_conf=0.35):
        ratio = img_info["ratio"]
        img = img_info["raw_img"]
        if output is None:
            return img
        output = output.numpy()

        # preprocessing: resize
        bboxes = output[:, 0:4] / ratio

        cls = output[:, 6]
        scores = output[:, 4] * output[:, 5]

        vis_res = vis(img, bboxes, scores, cls, cls_conf, self.cls_names)
        return vis_res


def image_demo(predictor, vis_folder, path, current_time, save_result):
    if os.path.isdir(path):
        files = get_image_list(path)
    else:
        files = [path]
    files.sort()
    for image_name in files:
        outputs, img_info = predictor.inference(image_name)
        result_image = predictor.visual(outputs[0], img_info)
        if save_result:
            save_folder = os.path.join(
                vis_folder, time.strftime("%Y_%m_%d_%H_%M_%S", current_time)
            )
            os.makedirs(save_folder, exist_ok=True)
            save_file_name = os.path.join(save_folder, os.path.basename(image_name))
            logger.info("Saving detection result in {}".format(save_file_name))
            cv2.imwrite(save_file_name, result_image)
        ch = cv2.waitKey(0)
        if ch == 27 or ch == ord("q") or ch == ord("Q"):
            break


def imageflow_demo(predictor, vis_folder, current_time, args):
    cap = cv2.VideoCapture(args.path if args.demo == "video" else args.camid)
    width = cap.get(cv2.CAP_PROP_FRAME_WIDTH)  # float
    height = cap.get(cv2.CAP_PROP_FRAME_HEIGHT)  # float
    fps = cap.get(cv2.CAP_PROP_FPS)
    save_folder = os.path.join(
        vis_folder, time.strftime("%Y_%m_%d_%H_%M_%S", current_time)
    )
    os.makedirs(save_folder, exist_ok=True)
    if args.demo == "video":
        save_path = os.path.join(save_folder, os.path.basename(args.path))
    else:
        save_path = os.path.join(save_folder, "camera.mp4")
    logger.info(f"video save_path is {save_path}")
    vid_writer = cv2.VideoWriter(
        save_path, cv2.VideoWriter_fourcc(*"mp4v"), fps, (int(width), int(height))
    )
    while True:
        ret_val, frame = cap.read()
        if ret_val:
            outputs, img_info = predictor.inference(frame)
            result_frame = predictor.visual(outputs[0], img_info)
            if args.save_result:
                vid_writer.write(result_frame)
            ch = cv2.waitKey(1)
            if ch == 27 or ch == ord("q") or ch == ord("Q"):
                break
        else:
            break


def main(args):
    file_name = os.path.join("./yolox_outputs", args.name)
    os.makedirs(file_name, exist_ok=True)

    if args.save_result:
        vis_folder = os.path.join(file_name, "vis_res")
        os.makedirs(vis_folder, exist_ok=True)

    confthre = 0.01
    nmsthre = 0.65
    test_size = (640, 640)
    if args.conf is not None:
        confthre = args.conf
    if args.nms is not None:
        nmsthre = args.nms
    if args.tsize is not None:
        test_size = (args.tsize, args.tsize)

    model = build_and_load(args.ckpt, name=args.name)
    model.eval()

    predictor = Predictor(model, confthre, nmsthre, test_size, COCO_CLASSES, None, None)
    current_time = time.localtime()
    if args.demo == "image":
        image_demo(predictor, vis_folder, args.path, current_time, args.save_result)
    elif args.demo == "video" or args.demo == "webcam":
        imageflow_demo(predictor, vis_folder, current_time, args)


if __name__ == "__main__":
    args = make_parser().parse_args()
    main(args)


================================================
FILE: demo/MegEngine/python/dump.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-
# Copyright (c) Megvii, Inc. and its affiliates.

import argparse

import megengine as mge
import numpy as np
from megengine import jit

from build import build_and_load


def make_parser():
    parser = argparse.ArgumentParser("YOLOX Demo Dump")
    parser.add_argument("-n", "--name", type=str, default="yolox-s", help="model name")
    parser.add_argument("-c", "--ckpt", default=None, type=str, help="ckpt for eval")
    parser.add_argument(
        "--dump_path", default="model.mge", help="path to save the dumped model"
    )
    return parser


def dump_static_graph(model, graph_name="model.mge"):
    model.eval()
    model.head.decode_in_inference = False

    data = mge.Tensor(np.random.random((1, 3, 640, 640)))

    @jit.trace(capture_as_const=True)
    def pred_func(data):
        outputs = model(data)
        return outputs

    pred_func(data)
    pred_func.dump(
        graph_name,
        arg_names=["data"],
        optimize_for_inference=True,
        enable_fuse_conv_bias_nonlinearity=True,
    )


def main(args):
    model = build_and_load(args.ckpt, name=args.name)
    dump_static_graph(model, args.dump_path)


if __name__ == "__main__":
    args = make_parser().parse_args()
    main(args)


================================================
FILE: demo/MegEngine/python/models/__init__.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

from .darknet import CSPDarknet, Darknet
from .yolo_fpn import YOLOFPN
from .yolo_head import YOLOXHead
from .yolo_pafpn import YOLOPAFPN
from .yolox import YOLOX


================================================
FILE: demo/MegEngine/python/models/darknet.py
================================================
#!/usr/bin/env python3
# -*- encoding: utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.module as M

from .network_blocks import BaseConv, CSPLayer, DWConv, Focus, ResLayer, SPPBottleneck


class Darknet(M.Module):
    # number of blocks from dark2 to dark5.
    depth2blocks = {21: [1, 2, 2, 1], 53: [2, 8, 8, 4]}

    def __init__(
        self, depth, in_channels=3, stem_out_channels=32, out_features=("dark3", "dark4", "dark5"),
    ):
        """
        Args:
            depth (int): depth of darknet used in model, usually use [21, 53] for this param.
            in_channels (int): number of input channels, for example, use 3 for RGB image.
            stem_out_channels (int): number of output channels of darknet stem.
                It decides channels of darknet layer2 to layer5.
            out_features (Tuple[str]): desired output layer name.
        """
        super().__init__()
        assert out_features, "please provide output features of Darknet"
        self.out_features = out_features
        self.stem = M.Sequential(
            BaseConv(in_channels, stem_out_channels, ksize=3, stride=1, act="lrelu"),
            *self.make_group_layer(stem_out_channels, num_blocks=1, stride=2),
        )
        in_channels = stem_out_channels * 2  # 64

        num_blocks = Darknet.depth2blocks[depth]
        # create darknet with `stem_out_channels` and `num_blocks` layers.
        # to make model structure more clear, we don't use `for` statement in python.
        self.dark2 = M.Sequential(*self.make_group_layer(in_channels, num_blocks[0], stride=2))
        in_channels *= 2  # 128
        self.dark3 = M.Sequential(*self.make_group_layer(in_channels, num_blocks[1], stride=2))
        in_channels *= 2  # 256
        self.dark4 = M.Sequential(*self.make_group_layer(in_channels, num_blocks[2], stride=2))
        in_channels *= 2  # 512

        self.dark5 = M.Sequential(
            *self.make_group_layer(in_channels, num_blocks[3], stride=2),
            *self.make_spp_block([in_channels, in_channels * 2], in_channels * 2),
        )

    def make_group_layer(self, in_channels: int, num_blocks: int, stride: int = 1):
        "starts with conv layer then has `num_blocks` `ResLayer`"
        return [
            BaseConv(in_channels, in_channels * 2, ksize=3, stride=stride, act="lrelu"),
            *[(ResLayer(in_channels * 2)) for _ in range(num_blocks)]
        ]

    def make_spp_block(self, filters_list, in_filters):
        m = M.Sequential(
            *[
                BaseConv(in_filters, filters_list[0], 1, stride=1, act="lrelu"),
                BaseConv(filters_list[0], filters_list[1], 3, stride=1, act="lrelu"),
                SPPBottleneck(
                    in_channels=filters_list[1],
                    out_channels=filters_list[0],
                    activation="lrelu"
                ),
                BaseConv(filters_list[0], filters_list[1], 3, stride=1, act="lrelu"),
                BaseConv(filters_list[1], filters_list[0], 1, stride=1, act="lrelu"),
            ]
        )
        return m

    def forward(self, x):
        outputs = {}
        x = self.stem(x)
        outputs["stem"] = x
        x = self.dark2(x)
        outputs["dark2"] = x
        x = self.dark3(x)
        outputs["dark3"] = x
        x = self.dark4(x)
        outputs["dark4"] = x
        x = self.dark5(x)
        outputs["dark5"] = x
        return {k: v for k, v in outputs.items() if k in self.out_features}


class CSPDarknet(M.Module):

    def __init__(
        self, dep_mul, wid_mul,
        out_features=("dark3", "dark4", "dark5"),
        depthwise=False, act="silu",
    ):
        super().__init__()
        assert out_features, "please provide output features of Darknet"
        self.out_features = out_features
        Conv = DWConv if depthwise else BaseConv

        base_channels = int(wid_mul * 64)  # 64
        base_depth = max(round(dep_mul * 3), 1)  # 3

        # stem
        self.stem = Focus(3, base_channels, ksize=3, act=act)

        # dark2
        self.dark2 = M.Sequential(
            Conv(base_channels, base_channels * 2, 3, 2, act=act),
            CSPLayer(
                base_channels * 2, base_channels * 2,
                n=base_depth, depthwise=depthwise, act=act
            ),
        )

        # dark3
        self.dark3 = M.Sequential(
            Conv(base_channels * 2, base_channels * 4, 3, 2, act=act),
            CSPLayer(
                base_channels * 4, base_channels * 4,
                n=base_depth * 3, depthwise=depthwise, act=act,
            ),
        )

        # dark4
        self.dark4 = M.Sequential(
            Conv(base_channels * 4, base_channels * 8, 3, 2, act=act),
            CSPLayer(
                base_channels * 8, base_channels * 8,
                n=base_depth * 3, depthwise=depthwise, act=act,
            ),
        )

        # dark5
        self.dark5 = M.Sequential(
            Conv(base_channels * 8, base_channels * 16, 3, 2, act=act),
            SPPBottleneck(base_channels * 16, base_channels * 16, activation=act),
            CSPLayer(
                base_channels * 16, base_channels * 16, n=base_depth,
                shortcut=False, depthwise=depthwise, act=act,
            ),
        )

    def forward(self, x):
        outputs = {}
        x = self.stem(x)
        outputs["stem"] = x
        x = self.dark2(x)
        outputs["dark2"] = x
        x = self.dark3(x)
        outputs["dark3"] = x
        x = self.dark4(x)
        outputs["dark4"] = x
        x = self.dark5(x)
        outputs["dark5"] = x
        return {k: v for k, v in outputs.items() if k in self.out_features}


================================================
FILE: demo/MegEngine/python/models/network_blocks.py
================================================
#!/usr/bin/env python3
# -*- encoding: utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.functional as F
import megengine.module as M


class UpSample(M.Module):

    def __init__(self, scale_factor=2, mode="bilinear"):
        super().__init__()
        self.scale_factor = scale_factor
        self.mode = mode

    def forward(self, x):
        return F.vision.interpolate(x, scale_factor=self.scale_factor, mode=self.mode)


class SiLU(M.Module):
    """export-friendly version of M.SiLU()"""

    @staticmethod
    def forward(x):
        return x * F.sigmoid(x)


def get_activation(name="silu"):
    if name == "silu":
        module = SiLU()
    elif name == "relu":
        module = M.ReLU()
    elif name == "lrelu":
        module = M.LeakyReLU(0.1)
    else:
        raise AttributeError("Unsupported act type: {}".format(name))
    return module


class BaseConv(M.Module):
    """A Conv2d -> Batchnorm -> silu/leaky relu block"""

    def __init__(self, in_channels, out_channels, ksize, stride, groups=1, bias=False, act="silu"):
        super().__init__()
        # same padding
        pad = (ksize - 1) // 2
        self.conv = M.Conv2d(
            in_channels,
            out_channels,
            kernel_size=ksize,
            stride=stride,
            padding=pad,
            groups=groups,
            bias=bias,
        )
        self.bn = M.BatchNorm2d(out_channels)
        self.act = get_activation(act)

    def forward(self, x):
        return self.act(self.bn(self.conv(x)))

    def fuseforward(self, x):
        return self.act(self.conv(x))


class DWConv(M.Module):
    """Depthwise Conv + Conv"""
    def __init__(self, in_channels, out_channels, ksize, stride=1, act="silu"):
        super().__init__()
        self.dconv = BaseConv(
            in_channels, in_channels, ksize=ksize,
            stride=stride, groups=in_channels, act=act
        )
        self.pconv = BaseConv(
            in_channels, out_channels, ksize=1,
            stride=1, groups=1, act=act
        )

    def forward(self, x):
        x = self.dconv(x)
        return self.pconv(x)


class Bottleneck(M.Module):
    # Standard bottleneck
    def __init__(
        self, in_channels, out_channels, shortcut=True,
        expansion=0.5, depthwise=False, act="silu"
    ):
        super().__init__()
        hidden_channels = int(out_channels * expansion)
        Conv = DWConv if depthwise else BaseConv
        self.conv1 = BaseConv(in_channels, hidden_channels, 1, stride=1, act=act)
        self.conv2 = Conv(hidden_channels, out_channels, 3, stride=1, act=act)
        self.use_add = shortcut and in_channels == out_channels

    def forward(self, x):
        y = self.conv2(self.conv1(x))
        if self.use_add:
            y = y + x
        return y


class ResLayer(M.Module):
    "Residual layer with `in_channels` inputs."
    def __init__(self, in_channels: int):
        super().__init__()
        mid_channels = in_channels // 2
        self.layer1 = BaseConv(in_channels, mid_channels, ksize=1, stride=1, act="lrelu")
        self.layer2 = BaseConv(mid_channels, in_channels, ksize=3, stride=1, act="lrelu")

    def forward(self, x):
        out = self.layer2(self.layer1(x))
        return x + out


class SPPBottleneck(M.Module):
    """Spatial pyramid pooling layer used in YOLOv3-SPP"""
    def __init__(self, in_channels, out_channels, kernel_sizes=(5, 9, 13), activation="silu"):
        super().__init__()
        hidden_channels = in_channels // 2
        self.conv1 = BaseConv(in_channels, hidden_channels, 1, stride=1, act=activation)
        self.m = [M.MaxPool2d(kernel_size=ks, stride=1, padding=ks // 2) for ks in kernel_sizes]
        conv2_channels = hidden_channels * (len(kernel_sizes) + 1)
        self.conv2 = BaseConv(conv2_channels, out_channels, 1, stride=1, act=activation)

    def forward(self, x):
        x = self.conv1(x)
        x = F.concat([x] + [m(x) for m in self.m], axis=1)
        x = self.conv2(x)
        return x


class CSPLayer(M.Module):
    """C3 in yolov5, CSP Bottleneck with 3 convolutions"""

    def __init__(
        self, in_channels, out_channels, n=1,
        shortcut=True, expansion=0.5, depthwise=False, act="silu"
    ):
        """
        Args:
            in_channels (int): input channels.
            out_channels (int): output channels.
            n (int): number of Bottlenecks. Default value: 1.
        """
        # ch_in, ch_out, number, shortcut, groups, expansion
        super().__init__()
        hidden_channels = int(out_channels * expansion)  # hidden channels
        self.conv1 = BaseConv(in_channels, hidden_channels, 1, stride=1, act=act)
        self.conv2 = BaseConv(in_channels, hidden_channels, 1, stride=1, act=act)
        self.conv3 = BaseConv(2 * hidden_channels, out_channels, 1, stride=1, act=act)
        module_list = [
            Bottleneck(hidden_channels, hidden_channels, shortcut, 1.0, depthwise, act=act)
            for _ in range(n)
        ]
        self.m = M.Sequential(*module_list)

    def forward(self, x):
        x_1 = self.conv1(x)
        x_2 = self.conv2(x)
        x_1 = self.m(x_1)
        x = F.concat((x_1, x_2), axis=1)
        return self.conv3(x)


class Focus(M.Module):
    """Focus width and height information into channel space."""

    def __init__(self, in_channels, out_channels, ksize=1, stride=1, act="silu"):
        super().__init__()
        self.conv = BaseConv(in_channels * 4, out_channels, ksize, stride, act=act)

    def forward(self, x):
        # shape of x (b,c,w,h) -> y(b,4c,w/2,h/2)
        patch_top_left = x[..., ::2, ::2]
        patch_top_right = x[..., ::2, 1::2]
        patch_bot_left = x[..., 1::2, ::2]
        patch_bot_right = x[..., 1::2, 1::2]
        x = F.concat(
            (patch_top_left, patch_bot_left, patch_top_right, patch_bot_right,), axis=1,
        )
        return self.conv(x)


================================================
FILE: demo/MegEngine/python/models/yolo_fpn.py
================================================
#!/usr/bin/env python3
# -*- encoding: utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.functional as F
import megengine.module as M

from .darknet import Darknet
from .network_blocks import BaseConv, UpSample


class YOLOFPN(M.Module):
    """
    YOLOFPN module. Darknet 53 is the default backbone of this model.
    """

    def __init__(
        self, depth=53, in_features=["dark3", "dark4", "dark5"],
    ):
        super().__init__()

        self.backbone = Darknet(depth)
        self.in_features = in_features

        # out 1
        self.out1_cbl = self._make_cbl(512, 256, 1)
        self.out1 = self._make_embedding([256, 512], 512 + 256)

        # out 2
        self.out2_cbl = self._make_cbl(256, 128, 1)
        self.out2 = self._make_embedding([128, 256], 256 + 128)

        # upsample
        self.upsample = UpSample(scale_factor=2, mode="bilinear")

    def _make_cbl(self, _in, _out, ks):
        return BaseConv(_in, _out, ks, stride=1, act="lrelu")

    def _make_embedding(self, filters_list, in_filters):
        m = M.Sequential(
            *[
                self._make_cbl(in_filters, filters_list[0], 1),
                self._make_cbl(filters_list[0], filters_list[1], 3),

                self._make_cbl(filters_list[1], filters_list[0], 1),

                self._make_cbl(filters_list[0], filters_list[1], 3),
                self._make_cbl(filters_list[1], filters_list[0], 1),
            ]
        )
        return m

    def forward(self, inputs):
        """
        Args:
            inputs (Tensor): input image.

        Returns:
            Tuple[Tensor]: FPN output features..
        """
        #  backbone
        out_features = self.backbone(inputs)
        x2, x1, x0 = [out_features[f] for f in self.in_features]

        #  yolo branch 1
        x1_in = self.out1_cbl(x0)
        x1_in = self.upsample(x1_in)
        x1_in = F.concat([x1_in, x1], 1)
        out_dark4 = self.out1(x1_in)

        #  yolo branch 2
        x2_in = self.out2_cbl(out_dark4)
        x2_in = self.upsample(x2_in)
        x2_in = F.concat([x2_in, x2], 1)
        out_dark3 = self.out2(x2_in)

        outputs = (out_dark3, out_dark4, x0)
        return outputs


================================================
FILE: demo/MegEngine/python/models/yolo_head.py
================================================
#!/usr/bin/env python3
# -*- coding:utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.functional as F
import megengine.module as M

from .network_blocks import BaseConv, DWConv


def meshgrid(x, y):
    """meshgrid wrapper for megengine"""
    assert len(x.shape) == 1
    assert len(y.shape) == 1
    mesh_shape = (y.shape[0], x.shape[0])
    mesh_x = F.broadcast_to(x, mesh_shape)
    mesh_y = F.broadcast_to(y.reshape(-1, 1), mesh_shape)
    return mesh_x, mesh_y


class YOLOXHead(M.Module):
    def __init__(
        self, num_classes, width=1.0, strides=[8, 16, 32],
        in_channels=[256, 512, 1024], act="silu", depthwise=False
    ):
        """
        Args:
            act (str): activation type of conv. Defalut value: "silu".
            depthwise (bool): whether apply depthwise conv in conv branch. Defalut value: False.
        """
        super().__init__()

        self.n_anchors = 1
        self.num_classes = num_classes
        self.decode_in_inference = True  # save for matching

        self.cls_convs = []
        self.reg_convs = []
        self.cls_preds = []
        self.reg_preds = []
        self.obj_preds = []
        self.stems = []
        Conv = DWConv if depthwise else BaseConv

        for i in range(len(in_channels)):
            self.stems.append(
                BaseConv(
                    in_channels=int(in_channels[i] * width),
                    out_channels=int(256 * width),
                    ksize=1,
                    stride=1,
                    act=act,
                )
            )
            self.cls_convs.append(
                M.Sequential(
                    *[
                        Conv(
                            in_channels=int(256 * width),
                            out_channels=int(256 * width),
                            ksize=3,
                            stride=1,
                            act=act,
                        ),
                        Conv(
                            in_channels=int(256 * width),
                            out_channels=int(256 * width),
                            ksize=3,
                            stride=1,
                            act=act,
                        ),
                    ]
                )
            )
            self.reg_convs.append(
                M.Sequential(
                    *[
                        Conv(
                            in_channels=int(256 * width),
                            out_channels=int(256 * width),
                            ksize=3,
                            stride=1,
                            act=act,
                        ),
                        Conv(
                            in_channels=int(256 * width),
                            out_channels=int(256 * width),
                            ksize=3,
                            stride=1,
                            act=act,
                        ),
                    ]
                )
            )
            self.cls_preds.append(
                M.Conv2d(
                    in_channels=int(256 * width),
                    out_channels=self.n_anchors * self.num_classes,
                    kernel_size=1,
                    stride=1,
                    padding=0,
                )
            )
            self.reg_preds.append(
                M.Conv2d(
                    in_channels=int(256 * width),
                    out_channels=4,
                    kernel_size=1,
                    stride=1,
                    padding=0,
                )
            )
            self.obj_preds.append(
                M.Conv2d(
                    in_channels=int(256 * width),
                    out_channels=self.n_anchors * 1,
                    kernel_size=1,
                    stride=1,
                    padding=0,
                )
            )

        self.use_l1 = False
        self.strides = strides
        self.grids = [F.zeros(1)] * len(in_channels)

    def forward(self, xin, labels=None, imgs=None):
        outputs = []
        assert not self.training

        for k, (cls_conv, reg_conv, stride_this_level, x) in enumerate(
            zip(self.cls_convs, self.reg_convs, self.strides, xin)
        ):
            x = self.stems[k](x)
            cls_x = x
            reg_x = x

            cls_feat = cls_conv(cls_x)
            cls_output = self.cls_preds[k](cls_feat)

            reg_feat = reg_conv(reg_x)
            reg_output = self.reg_preds[k](reg_feat)
            obj_output = self.obj_preds[k](reg_feat)
            output = F.concat([reg_output, F.sigmoid(obj_output), F.sigmoid(cls_output)], 1)
            outputs.append(output)

        self.hw = [x.shape[-2:] for x in outputs]
        # [batch, n_anchors_all, 85]
        outputs = F.concat([F.flatten(x, start_axis=2) for x in outputs], axis=2)
        outputs = F.transpose(outputs, (0, 2, 1))
        if self.decode_in_inference:
            return self.decode_outputs(outputs)
        else:
            return outputs

    def get_output_and_grid(self, output, k, stride, dtype):
        grid = self.grids[k]

        batch_size = output.shape[0]
        n_ch = 5 + self.num_classes
        hsize, wsize = output.shape[-2:]
        if grid.shape[2:4] != output.shape[2:4]:
            yv, xv = meshgrid([F.arange(hsize), F.arange(wsize)])
            grid = F.stack((xv, yv), 2).reshape(1, 1, hsize, wsize, 2).type(dtype)
            self.grids[k] = grid

        output = output.view(batch_size, self.n_anchors, n_ch, hsize, wsize)
        output = (
            output.permute(0, 1, 3, 4, 2)
            .reshape(batch_size, self.n_anchors * hsize * wsize, -1)
        )
        grid = grid.view(1, -1, 2)
        output[..., :2] = (output[..., :2] + grid) * stride
        output[..., 2:4] = F.exp(output[..., 2:4]) * stride
        return output, grid

    def decode_outputs(self, outputs):
        grids = []
        strides = []
        for (hsize, wsize), stride in zip(self.hw, self.strides):
            xv, yv = meshgrid(F.arange(hsize), F.arange(wsize))
            grid = F.stack((xv, yv), 2).reshape(1, -1, 2)
            grids.append(grid)
            shape = grid.shape[:2]
            strides.append(F.full((*shape, 1), stride))

        grids = F.concat(grids, axis=1)
        strides = F.concat(strides, axis=1)

        outputs[..., :2] = (outputs[..., :2] + grids) * strides
        outputs[..., 2:4] = F.exp(outputs[..., 2:4]) * strides
        return outputs


================================================
FILE: demo/MegEngine/python/models/yolo_pafpn.py
================================================
#!/usr/bin/env python3
# -*- encoding: utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.module as M
import megengine.functional as F

from .darknet import CSPDarknet
from .network_blocks import BaseConv, CSPLayer, DWConv, UpSample


class YOLOPAFPN(M.Module):
    """
    YOLOv3 model. Darknet 53 is the default backbone of this model.
    """

    def __init__(
        self, depth=1.0, width=1.0, in_features=("dark3", "dark4", "dark5"),
        in_channels=[256, 512, 1024], depthwise=False, act="silu",
    ):
        super().__init__()
        self.backbone = CSPDarknet(depth, width, depthwise=depthwise, act=act)
        self.in_features = in_features
        self.in_channels = in_channels
        Conv = DWConv if depthwise else BaseConv

        self.upsample = UpSample(scale_factor=2, mode="bilinear")
        self.lateral_conv0 = BaseConv(
            int(in_channels[2] * width), int(in_channels[1] * width), 1, 1, act=act
        )
        self.C3_p4 = CSPLayer(
            int(2 * in_channels[1] * width),
            int(in_channels[1] * width),
            round(3 * depth),
            False,
            depthwise=depthwise,
            act=act,
        )  # cat

        self.reduce_conv1 = BaseConv(
            int(in_channels[1] * width), int(in_channels[0] * width), 1, 1, act=act
        )
        self.C3_p3 = CSPLayer(
            int(2 * in_channels[0] * width),
            int(in_channels[0] * width),
            round(3 * depth),
            False,
            depthwise=depthwise,
            act=act,
        )

        # bottom-up conv
        self.bu_conv2 = Conv(
            int(in_channels[0] * width), int(in_channels[0] * width), 3, 2, act=act
        )
        self.C3_n3 = CSPLayer(
            int(2 * in_channels[0] * width),
            int(in_channels[1] * width),
            round(3 * depth),
            False,
            depthwise=depthwise,
            act=act,
        )

        # bottom-up conv
        self.bu_conv1 = Conv(
            int(in_channels[1] * width), int(in_channels[1] * width), 3, 2, act=act
        )
        self.C3_n4 = CSPLayer(
            int(2 * in_channels[1] * width),
            int(in_channels[2] * width),
            round(3 * depth),
            False,
            depthwise=depthwise,
            act=act,
        )

    def forward(self, input):
        """
        Args:
            inputs: input images.

        Returns:
            Tuple[Tensor]: FPN feature.
        """

        #  backbone
        out_features = self.backbone(input)
        features = [out_features[f] for f in self.in_features]
        [x2, x1, x0] = features

        fpn_out0 = self.lateral_conv0(x0)  # 1024->512/32
        f_out0 = self.upsample(fpn_out0)  # 512/16
        f_out0 = F.concat([f_out0, x1], 1)  # 512->1024/16
        f_out0 = self.C3_p4(f_out0)  # 1024->512/16

        fpn_out1 = self.reduce_conv1(f_out0)  # 512->256/16
        f_out1 = self.upsample(fpn_out1)  # 256/8
        f_out1 = F.concat([f_out1, x2], 1)  # 256->512/8
        pan_out2 = self.C3_p3(f_out1)  # 512->256/8

        p_out1 = self.bu_conv2(pan_out2)  # 256->256/16
        p_out1 = F.concat([p_out1, fpn_out1], 1)  # 256->512/16
        pan_out1 = self.C3_n3(p_out1)  # 512->512/16

        p_out0 = self.bu_conv1(pan_out1)  # 512->512/32
        p_out0 = F.concat([p_out0, fpn_out0], 1)  # 512->1024/32
        pan_out0 = self.C3_n4(p_out0)  # 1024->1024/32

        outputs = (pan_out2, pan_out1, pan_out0)
        return outputs


================================================
FILE: demo/MegEngine/python/models/yolox.py
================================================
#!/usr/bin/env python3
# -*- encoding: utf-8 -*-
# Copyright (c) Megvii Inc. All rights reserved.

import megengine.module as M

from .yolo_head import YOLOXHead
from .yolo_pafpn import YOLOPAFPN


class YOLOX(M.Module):
    """
    YOLOX model module. The module list is defined by create_yolov3_modules function.
    The network returns loss values from three YOLO layers during training
    and detection results during test.
    """

    def __init__(self, backbone=None, head=None):
        super().__init__()
        if backbone is None:
            backbone = YOLOPAFPN()
        if head is None:
            head = YOLOXHead(80)

        self.backbone = backbone
        self.head = head

    def forward(self, x):
        # fpn output content features of [dark3, dark4, dark5]
        fpn_outs = self.backbone(x)
        assert not self.training
        outputs = self.head(fpn_outs)

        return outputs


================================================
FILE: demo/ONNXRuntime/README.md
================================================
## YOLOX-ONNXRuntime in Python

This doc introduces how to convert your pytorch model into onnx, and how to run an onnxruntime demo to verify your convertion.

### Step1: Install onnxruntime

run the following command to install onnxruntime:
```shell
pip install onnxruntime
```

### Step2: Get ONNX models

Users might download our pre-generated ONNX models or convert their own models to ONNX.

#### Download ONNX models.

| Model | Parameters | GFLOPs | Test Size | mAP | Weights |
|:------| :----: | :----: | :---: | :---: | :---: |
|  YOLOX-Nano |  0.91M  | 1.08 | 416x416 | 25.8 |[github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_nano.onnx) |
|  YOLOX-Tiny | 5.06M     | 6.45 | 416x416 |32.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_tiny.onnx) |
|  YOLOX-S | 9.0M | 26.8 | 640x640 |40.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_s.onnx) |
|  YOLOX-M | 25.3M | 73.8 | 640x640 |47.2 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_m.onnx) |
|  YOLOX-L | 54.2M | 155.6 | 640x640 |50.1 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_l.onnx) |
|  YOLOX-Darknet53| 63.72M | 185.3 | 640x640 |48.0 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_darknet.onnx) |
|  YOLOX-X | 99.1M | 281.9 | 640x640 |51.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_x.onnx) |

#### Convert Your Model to ONNX

First, you should move to <YOLOX_HOME> by:
```shell
cd <YOLOX_HOME>
```
Then, you can:

1. Convert a standard YOLOX model by -n:
```shell
python3 tools/export_onnx.py --output-name yolox_s.onnx -n yolox-s -c yolox_s.pth
```
Notes:
* -n: specify a model name. The model name must be one of the [yolox-s,m,l,x and yolox-nano, yolox-tiny, yolov3]
* -c: the model you have trained
* -o: opset version, default 11. **However, if you will further convert your onnx model to [OpenVINO](https://github.com/Megvii-BaseDetection/YOLOX/demo/OpenVINO/), please specify the opset version to 10.**
* --no-onnxsim: disable onnxsim
* To customize an input shape for onnx model,  modify the following code in tools/export.py:

    ```python
    dummy_input = torch.randn(1, 3, exp.test_size[0], exp.test_size[1])
    ```

1. Convert a standard YOLOX model by -f. When using -f, the above command is equivalent to:

```shell
python3 tools/export_onnx.py --output-name yolox_s.onnx -f exps/default/yolox_s.py -c yolox_s.pth
```

3. To convert your customized model, please use -f:

```shell
python3 tools/export_onnx.py --output-name your_yolox.onnx -f exps/your_dir/your_yolox.py -c your_yolox.pth
```

### Step3: ONNXRuntime Demo

Step1.
```shell
cd <YOLOX_HOME>/demo/ONNXRuntime
```

Step2. 
```shell
python3 onnx_inference.py -m <ONNX_MODEL_PATH> -i <IMAGE_PATH> -o <OUTPUT_DIR> -s 0.3 --input_shape 640,640
```
Notes:
* -m: your converted onnx model
* -i: input_image
* -s: score threshold for visualization.
* --input_shape: should be consistent with the shape you used for onnx convertion.


================================================
FILE: demo/ONNXRuntime/onnx_inference.py
================================================
#!/usr/bin/env python3
# Copyright (c) Megvii, Inc. and its affiliates.

import argparse
import os

import cv2
import numpy as np

import onnxruntime

from yolox.data.data_augment import preproc as preprocess
from yolox.data.datasets import COCO_CLASSES
from yolox.utils import mkdir, multiclass_nms, demo_postprocess, vis


def make_parser():
    parser = argparse.ArgumentParser("onnxruntime inference sample")
    parser.add_argument(
        "-m",
        "--model",
        type=str,
        default="yolox.onnx",
        help="Input your onnx model.",
    )
    parser.add_argument(
        "-i",
        "--image_path",
        type=str,
        default='test_image.png',
        help="Path to your input image.",
    )
    parser.add_argument(
        "-o",
        "--output_dir",
        type=str,
        default='demo_output',
        help="Path to your output directory.",
    )
    parser.add_argument(
        "-s",
        "--score_thr",
        type=float,
        default=0.3,
        help="Score threshould to filter the result.",
    )
    parser.add_argument(
        "--input_shape",
        type=str,
        default="640,640",
        help="Specify an input shape for inference.",
    )
    return parser


if __name__ == '__main__':
    args = make_parser().parse_args()

    input_shape = tuple(map(int, args.input_shape.split(',')))
    origin_img = cv2.imread(args.image_path)
    img, ratio = preprocess(origin_img, input_shape)

    session = onnxruntime.InferenceSession(args.model)

    ort_inputs = {session.get_inputs()[0].name: img[None, :, :, :]}
    output = session.run(None, ort_inputs)
    predictions = demo_postprocess(output[0], input_shape)[0]

    boxes = predictions[:, :4]
    scores = predictions[:, 4:5] * predictions[:, 5:]

    boxes_xyxy = np.ones_like(boxes)
    boxes_xyxy[:, 0] = boxes[:, 0] - boxes[:, 2]/2.
    boxes_xyxy[:, 1] = boxes[:, 1] - boxes[:, 3]/2.
    boxes_xyxy[:, 2] = boxes[:, 0] + boxes[:, 2]/2.
    boxes_xyxy[:, 3] = boxes[:, 1] + boxes[:, 3]/2.
    boxes_xyxy /= ratio
    dets = multiclass_nms(boxes_xyxy, scores, nms_thr=0.45, score_thr=0.1)
    if dets is not None:
        final_boxes, final_scores, final_cls_inds = dets[:, :4], dets[:, 4], dets[:, 5]
        origin_img = vis(origin_img, final_boxes, final_scores, final_cls_inds,
                         conf=args.score_thr, class_names=COCO_CLASSES)

    mkdir(args.output_dir)
    output_path = os.path.join(args.output_dir, os.path.basename(args.image_path))
    cv2.imwrite(output_path, origin_img)


================================================
FILE: demo/OpenVINO/README.md
================================================
## YOLOX for OpenVINO

* [C++ Demo](./cpp)
* [Python Demo](./python)

================================================
FILE: demo/OpenVINO/cpp/CMakeLists.txt
================================================
cmake_minimum_required(VERSION 3.4.1)
set(CMAKE_CXX_STANDARD 14)

project(yolox_openvino_demo)

find_package(OpenCV REQUIRED)
find_package(InferenceEngine REQUIRED)
find_package(ngraph REQUIRED)

include_directories(
    ${OpenCV_INCLUDE_DIRS}
    ${CMAKE_CURRENT_SOURCE_DIR}
    ${CMAKE_CURRENT_BINARY_DIR}
)

add_executable(yolox_openvino yolox_openvino.cpp)

target_link_libraries(
     yolox_openvino
    ${InferenceEngine_LIBRARIES}
    ${NGRAPH_LIBRARIES}
    ${OpenCV_LIBS} 
)

================================================
FILE: demo/OpenVINO/cpp/README.md
================================================
# YOLOX-OpenVINO in C++

This tutorial includes a C++ demo for OpenVINO, as well as some converted models.

### Download OpenVINO models.

| Model | Parameters | GFLOPs | Test Size | mAP | Weights |
|:------| :----: | :----: | :---: | :---: | :---: |
|  [YOLOX-Nano](../../../exps/default/nano.py) |  0.91M  | 1.08 | 416x416 | 25.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_nano_openvino.tar.gz) |
|  [YOLOX-Tiny](../../../exps/default/yolox_tiny.py) | 5.06M     | 6.45 | 416x416 |32.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_tiny_openvino.tar.gz) |
|  [YOLOX-S](../../../exps/default/yolox_s.py) | 9.0M | 26.8 | 640x640 |40.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_s_openvino.tar.gz) |
|  [YOLOX-M](../../../exps/default/yolox_m.py) | 25.3M | 73.8 | 640x640 |47.2 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_m_openvino.tar.gz) |
|  [YOLOX-L](../../../exps/default/yolox_l.py) | 54.2M | 155.6 | 640x640 |50.1 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_l_openvino.tar.gz) |
|  [YOLOX-Darknet53](../../../exps/default/yolov3.py) | 63.72M | 185.3 | 640x640 |48.0 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_dark_openvino.tar.gz) | 
|  [YOLOX-X](../../../exps/default/yolox_x.py) | 99.1M | 281.9 | 640x640 |51.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_x_openvino.tar.gz) |

## Install OpenVINO Toolkit

Please visit [Openvino Homepage](https://docs.openvinotoolkit.org/latest/get_started_guides.html) for more details.

## Set up the Environment

### For Linux

**Option1. Set up the environment tempororally. You need to run this command everytime you start a new shell window.**

```shell
source /opt/intel/openvino_2021/bin/setupvars.sh
```

**Option2. Set up the environment permenantly.**

*Step1.* For Linux:
```shell
vim ~/.bashrc 
```

*Step2.* Add the following line into your file:

```shell
source /opt/intel/openvino_2021/bin/setupvars.sh
```

*Step3.* Save and exit the file, then run:

```shell
source ~/.bashrc
```


## Convert model

1. Export ONNX model
   
   Please refer to the [ONNX tutorial](../../ONNXRuntime). **Note that you should set --opset to 10, otherwise your next step will fail.**

2. Convert ONNX to OpenVINO 

   ``` shell
   cd <INSTSLL_DIR>/openvino_2021/deployment_tools/model_optimizer
   ```

   Install requirements for convert tool

   ```shell
   sudo ./install_prerequisites/install_prerequisites_onnx.sh
   ```

   Then convert model.
   ```shell
   python3 mo.py --input_model <ONNX_MODEL> --input_shape <INPUT_SHAPE> [--data_type FP16]
   ```
   For example:
   ```shell
   python3 mo.py --input_model yolox_tiny.onnx --input_shape [1,3,416,416] --data_type FP16
   ```  

   Make sure the input shape is consistent with [those](yolox_openvino.cpp#L24-L25) in cpp file. 

## Build 

### Linux
```shell
source /opt/intel/openvino_2021/bin/setupvars.sh
mkdir build
cd build
cmake ..
make
```

## Demo

### c++

```shell
./yolox_openvino <XML_MODEL_PATH> <IMAGE_PATH> <DEVICE>
```


================================================
FILE: demo/OpenVINO/cpp/yolox_openvino.cpp
================================================
// Copyright (C) 2018-2021 Intel Corporation
// SPDX-License-Identifier: Apache-2.0
//

#include <iterator>
#include <memory>
#include <string>
#include <vector>
#include <opencv2/opencv.hpp>
#include <iostream>
#include <inference_engine.hpp>

using namespace InferenceEngine;

/**
 * @brief Define names based depends on Unicode path support
 */
#define tcout                  std::cout
#define file_name_t            std::string
#define imread_t               cv::imread
#define NMS_THRESH 0.45
#define BBOX_CONF_THRESH 0.3

static const int INPUT_W = 416;
static const int INPUT_H = 416;
static const int NUM_CLASSES = 80; // COCO has 80 classes. Modify this value on your own dataset.

cv::Mat static_resize(cv::Mat& img) {
    float r = std::min(INPUT_W / (img.cols*1.0), INPUT_H / (img.rows*1.0));
    // r = std::min(r, 1.0f);
    int unpad_w = r * img.cols;
    int unpad_h = r * img.rows;
    cv::Mat re(unpad_h, unpad_w, CV_8UC3);
    cv::resize(img, re, re.size());
    //cv::Mat out(INPUT_W, INPUT_H, CV_8UC3, cv::Scalar(114, 114, 114));
    cv::Mat out(INPUT_H, INPUT_W, CV_8UC3, cv::Scalar(114, 114, 114));
    re.copyTo(out(cv::Rect(0, 0, re.cols, re.rows)));
    return out;
}

void blobFromImage(cv::Mat& img, Blob::Ptr& blob){
    int channels = 3;
    int img_h = img.rows;
    int img_w = img.cols;
    InferenceEngine::MemoryBlob::Ptr mblob = InferenceEngine::as<InferenceEngine::MemoryBlob>(blob);
    if (!mblob) 
    {
        THROW_IE_EXCEPTION << "We expect blob to be inherited from MemoryBlob in matU8ToBlob, "
            << "but by fact we were not able to cast inputBlob to MemoryBlob";
    }
    // locked memory holder should be alive all time while access to its buffer happens
    auto mblobHolder = mblob->wmap();

    float *blob_data = mblobHolder.as<float *>();

    for (size_t c = 0; c < channels; c++) 
    {
        for (size_t  h = 0; h < img_h; h++) 
        {
            for (size_t w = 0; w < img_w; w++) 
            {
                blob_data[c * img_w * img_h + h * img_w + w] =
                    (float)img.at<cv::Vec3b>(h, w)[c];
            }
        }
    }
}


struct Object
{
    cv::Rect_<float> rect;
    int label;
    float prob;
};

struct GridAndStride
{
    int grid0;
    int grid1;
    int stride;
};

static void generate_grids_and_stride(const int target_w, const int target_h, std::vector<int>& strides, std::vector<GridAndStride>& grid_strides)
{
    for (auto stride : strides)
    {
        int num_grid_w = target_w / stride;
        int num_grid_h = target_h / stride;
        for (int g1 = 0; g1 < num_grid_h; g1++)
        {
            for (int g0 = 0; g0 < num_grid_w; g0++)
            {
                grid_strides.push_back((GridAndStride){g0, g1, stride});
            }
        }
    }
}


static void generate_yolox_proposals(std::vector<GridAndStride> grid_strides, const float* feat_ptr, float prob_threshold, std::vector<Object>& objects)
{

    const int num_anchors = grid_strides.size();

    for (int anchor_idx = 0; anchor_idx < num_anchors; anchor_idx++)
    {
        const int grid0 = grid_strides[anchor_idx].grid0;
        const int grid1 = grid_strides[anchor_idx].grid1;
        const int stride = grid_strides[anchor_idx].stride;

	const int basic_pos = anchor_idx * (NUM_CLASSES + 5);

        // yolox/models/yolo_head.py decode logic
        //  outputs[..., :2] = (outputs[..., :2] + grids) * strides
        //  outputs[..., 2:4] = torch.exp(outputs[..., 2:4]) * strides
        float x_center = (feat_ptr[basic_pos + 0] + grid0) * stride;
        float y_center = (feat_ptr[basic_pos + 1] + grid1) * stride;
        float w = exp(feat_ptr[basic_pos + 2]) * stride;
        float h = exp(feat_ptr[basic_pos + 3]) * stride;
        float x0 = x_center - w * 0.5f;
        float y0 = y_center - h * 0.5f;

        float box_objectness = feat_ptr[basic_pos + 4];
        for (int class_idx = 0; class_idx < NUM_CLASSES; class_idx++)
        {
            float box_cls_score = feat_ptr[basic_pos + 5 + class_idx];
            float box_prob = box_objectness * box_cls_score;
            if (box_prob > prob_threshold)
            {
                Object obj;
                obj.rect.x = x0;
                obj.rect.y = y0;
                obj.rect.width = w;
                obj.rect.height = h;
                obj.label = class_idx;
                obj.prob = box_prob;

                objects.push_back(obj);
            }

        } // class loop

    } // point anchor loop
}

static inline float intersection_area(const Object& a, const Object& b)
{
    cv::Rect_<float> inter = a.rect & b.rect;
    return inter.area();
}

static void qsort_descent_inplace(std::vector<Object>& faceobjects, int left, int right)
{
    int i = left;
    int j = right;
    float p = faceobjects[(left + right) / 2].prob;

    while (i <= j)
    {
        while (faceobjects[i].prob > p)
            i++;

        while (faceobjects[j].prob < p)
            j--;

        if (i <= j)
        {
            // swap
            std::swap(faceobjects[i], faceobjects[j]);

            i++;
            j--;
        }
    }

    #pragma omp parallel sections
    {
        #pragma omp section
        {
            if (left < j) qsort_descent_inplace(faceobjects, left, j);
        }
        #pragma omp section
        {
            if (i < right) qsort_descent_inplace(faceobjects, i, right);
        }
    }
}


static void qsort_descent_inplace(std::vector<Object>& objects)
{
    if (objects.empty())
        return;

    qsort_descent_inplace(objects, 0, objects.size() - 1);
}

static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, std::vector<int>& picked, float nms_threshold)
{
    picked.clear();

    const int n = faceobjects.size();

    std::vector<float> areas(n);
    for (int i = 0; i < n; i++)
    {
        areas[i] = faceobjects[i].rect.area();
    }

    for (int i = 0; i < n; i++)
    {
        const Object& a = faceobjects[i];

        int keep = 1;
        for (int j = 0; j < (int)picked.size(); j++)
        {
            const Object& b = faceobjects[picked[j]];

            // intersection over union
            float inter_area = intersection_area(a, b);
            float union_area = areas[i] + areas[picked[j]] - inter_area;
            // float IoU = inter_area / union_area
            if (inter_area / union_area > nms_threshold)
                keep = 0;
        }

        if (keep)
            picked.push_back(i);
    }
}


static void decode_outputs(const float* prob, std::vector<Object>& objects, float scale, const int img_w, const int img_h) {
        std::vector<Object> proposals;
        std::vector<int> strides = {8, 16, 32};
        std::vector<GridAndStride> grid_strides;

        generate_grids_and_stride(INPUT_W, INPUT_H, strides, grid_strides);
        generate_yolox_proposals(grid_strides, prob,  BBOX_CONF_THRESH, proposals);
        qsort_descent_inplace(proposals);

        std::vector<int> picked;
        nms_sorted_bboxes(proposals, picked, NMS_THRESH);
        int count = picked.size();
        objects.resize(count);

        for (int i = 0; i < count; i++)
        {
            objects[i] = proposals[picked[i]];

            // adjust offset to original unpadded
            float x0 = (objects[i].rect.x) / scale;
            float y0 = (objects[i].rect.y) / scale;
            float x1 = (objects[i].rect.x + objects[i].rect.width) / scale;
            float y1 = (objects[i].rect.y + objects[i].rect.height) / scale;

            // clip
            x0 = std::max(std::min(x0, (float)(img_w - 1)), 0.f);
            y0 = std::max(std::min(y0, (float)(img_h - 1)), 0.f);
            x1 = std::max(std::min(x1, (float)(img_w - 1)), 0.f);
            y1 = std::max(std::min(y1, (float)(img_h - 1)), 0.f);

            objects[i].rect.x = x0;
            objects[i].rect.y = y0;
            objects[i].rect.width = x1 - x0;
            objects[i].rect.height = y1 - y0;
        }
}

const float color_list[80][3] =
{
    {0.000, 0.447, 0.741},
    {0.850, 0.325, 0.098},
    {0.929, 0.694, 0.125},
    {0.494, 0.184, 0.556},
    {0.466, 0.674, 0.188},
    {0.301, 0.745, 0.933},
    {0.635, 0.078, 0.184},
    {0.300, 0.300, 0.300},
    {0.600, 0.600, 0.600},
    {1.000, 0.000, 0.000},
    {1.000, 0.500, 0.000},
    {0.749, 0.749, 0.000},
    {0.000, 1.000, 0.000},
    {0.000, 0.000, 1.000},
    {0.667, 0.000, 1.000},
    {0.333, 0.333, 0.000},
    {0.333, 0.667, 0.000},
    {0.333, 1.000, 0.000},
    {0.667, 0.333, 0.000},
    {0.667, 0.667, 0.000},
    {0.667, 1.000, 0.000},
    {1.000, 0.333, 0.000},
    {1.000, 0.667, 0.000},
    {1.000, 1.000, 0.000},
    {0.000, 0.333, 0.500},
    {0.000, 0.667, 0.500},
    {0.000, 1.000, 0.500},
    {0.333, 0.000, 0.500},
    {0.333, 0.333, 0.500},
    {0.333, 0.667, 0.500},
    {0.333, 1.000, 0.500},
    {0.667, 0.000, 0.500},
    {0.667, 0.333, 0.500},
    {0.667, 0.667, 0.500},
    {0.667, 1.000, 0.500},
    {1.000, 0.000, 0.500},
    {1.000, 0.333, 0.500},
    {1.000, 0.667, 0.500},
    {1.000, 1.000, 0.500},
    {0.000, 0.333, 1.000},
    {0.000, 0.667, 1.000},
    {0.000, 1.000, 1.000},
    {0.333, 0.000, 1.000},
    {0.333, 0.333, 1.000},
    {0.333, 0.667, 1.000},
    {0.333, 1.000, 1.000},
    {0.667, 0.000, 1.000},
    {0.667, 0.333, 1.000},
    {0.667, 0.667, 1.000},
    {0.667, 1.000, 1.000},
    {1.000, 0.000, 1.000},
    {1.000, 0.333, 1.000},
    {1.000, 0.667, 1.000},
    {0.333, 0.000, 0.000},
    {0.500, 0.000, 0.000},
    {0.667, 0.000, 0.000},
    {0.833, 0.000, 0.000},
    {1.000, 0.000, 0.000},
    {0.000, 0.167, 0.000},
    {0.000, 0.333, 0.000},
    {0.000, 0.500, 0.000},
    {0.000, 0.667, 0.000},
    {0.000, 0.833, 0.000},
    {0.000, 1.000, 0.000},
    {0.000, 0.000, 0.167},
    {0.000, 0.000, 0.333},
    {0.000, 0.000, 0.500},
    {0.000, 0.000, 0.667},
    {0.000, 0.000, 0.833},
    {0.000, 0.000, 1.000},
    {0.000, 0.000, 0.000},
    {0.143, 0.143, 0.143},
    {0.286, 0.286, 0.286},
    {0.429, 0.429, 0.429},
    {0.571, 0.571, 0.571},
    {0.714, 0.714, 0.714},
    {0.857, 0.857, 0.857},
    {0.000, 0.447, 0.741},
    {0.314, 0.717, 0.741},
    {0.50, 0.5, 0}
};

static void draw_objects(const cv::Mat& bgr, const std::vector<Object>& objects)
{
    static const char* class_names[] = {
        "person", "bicycle", "car", "motorcycle", "airplane", "bus", "train", "truck", "boat", "traffic light",
        "fire hydrant", "stop sign", "parking meter", "bench", "bird", "cat", "dog", "horse", "sheep", "cow",
        "elephant", "bear", "zebra", "giraffe", "backpack", "umbrella", "handbag", "tie", "suitcase", "frisbee",
        "skis", "snowboard", "sports ball", "kite", "baseball bat", "baseball glove", "skateboard", "surfboard",
        "tennis racket", "bottle", "wine glass", "cup", "fork", "knife", "spoon", "bowl", "banana", "apple",
        "sandwich", "orange", "broccoli", "carrot", "hot dog", "pizza", "donut", "cake", "chair", "couch",
        "potted plant", "bed", "dining table", "toilet", "tv", "laptop", "mouse", "remote", "keyboard", "cell phone",
        "microwave", "oven", "toaster", "sink", "refrigerator", "book", "clock", "vase", "scissors", "teddy bear",
        "hair drier", "toothbrush"
    };

    cv::Mat image = bgr.clone();

    for (size_t i = 0; i < objects.size(); i++)
    {
        const Object& obj = objects[i];

        fprintf(stderr, "%d = %.5f at %.2f %.2f %.2f x %.2f\n", obj.label, obj.prob,
                obj.rect.x, obj.rect.y, obj.rect.width, obj.rect.height);

        cv::Scalar color = cv::Scalar(color_list[obj.label][0], color_list[obj.label][1], color_list[obj.label][2]);
        float c_mean = cv::mean(color)[0];
        cv::Scalar txt_color;
        if (c_mean > 0.5){
            txt_color = cv::Scalar(0, 0, 0);
        }else{
            txt_color = cv::Scalar(255, 255, 255);
        }

        cv::rectangle(image, obj.rect, color * 255, 2);

        char text[256];
        sprintf(text, "%s %.1f%%", class_names[obj.label], obj.prob * 100);

        int baseLine = 0;
        cv::Size label_size = cv::getTextSize(text, cv::FONT_HERSHEY_SIMPLEX, 0.4, 1, &baseLine);

        cv::Scalar txt_bk_color = color * 0.7 * 255;

        int x = obj.rect.x;
        int y = obj.rect.y + 1;
        //int y = obj.rect.y - label_size.height - baseLine;
        if (y > image.rows)
            y = image.rows;
        //if (x + label_size.width > image.cols)
            //x = image.cols - label_size.width;

        cv::rectangle(image, cv::Rect(cv::Point(x, y), cv::Size(label_size.width, label_size.height + baseLine)),
                      txt_bk_color, -1);

        cv::putText(image, text, cv::Point(x, y + label_size.height),
                    cv::FONT_HERSHEY_SIMPLEX, 0.4, txt_color, 1);
    }

    cv::imwrite("_demo.jpg" , image);
    fprintf(stderr, "save vis file\n");
    /* cv::imshow("image", image); */
    /* cv::waitKey(0); */
}


int main(int argc, char* argv[]) {
    try {
        // ------------------------------ Parsing and validation of input arguments
        // ---------------------------------
        if (argc != 4) {
            tcout << "Usage : " << argv[0] << " <path_to_model> <path_to_image> <device_name>" << std::endl;
            return EXIT_FAILURE;
        }

        const file_name_t input_model {argv[1]};
        const file_name_t input_image_path {argv[2]};
        const std::string device_name {argv[3]};
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 1. Initialize inference engine core
        // -------------------------------------
        Core ie;
        // -----------------------------------------------------------------------------------------------------

        // Step 2. Read a model in OpenVINO Intermediate Representation (.xml and
        // .bin files) or ONNX (.onnx file) format
        CNNNetwork network = ie.ReadNetwork(input_model);
        if (network.getOutputsInfo().size() != 1)
            throw std::logic_error("Sample supports topologies with 1 output only");
        if (network.getInputsInfo().size() != 1)
            throw std::logic_error("Sample supports topologies with 1 input only");
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 3. Configure input & output
        // ---------------------------------------------
        // --------------------------- Prepare input blobs
        // -----------------------------------------------------
        InputInfo::Ptr input_info = network.getInputsInfo().begin()->second;
        std::string input_name = network.getInputsInfo().begin()->first;

        /* Mark input as resizable by setting of a resize algorithm.
         * In this case we will be able to set an input blob of any shape to an
         * infer request. Resize and layout conversions are executed automatically
         * during inference */
        //input_info->getPreProcess().setResizeAlgorithm(RESIZE_BILINEAR);
        //input_info->setLayout(Layout::NHWC);
        //input_info->setPrecision(Precision::FP32);

        // --------------------------- Prepare output blobs
        // ----------------------------------------------------
        if (network.getOutputsInfo().empty()) {
            std::cerr << "Network outputs info is empty" << std::endl;
            return EXIT_FAILURE;
        }
        DataPtr output_info = network.getOutputsInfo().begin()->second;
        std::string output_name = network.getOutputsInfo().begin()->first;

        output_info->setPrecision(Precision::FP32);
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 4. Loading a model to the device
        // ------------------------------------------
        ExecutableNetwork executable_network = ie.LoadNetwork(network, device_name);
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 5. Create an infer request
        // -------------------------------------------------
        InferRequest infer_request = executable_network.CreateInferRequest();
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 6. Prepare input
        // --------------------------------------------------------
        /* Read input image to a blob and set it to an infer request without resize
         * and layout conversions. */
        cv::Mat image = imread_t(input_image_path);
	    cv::Mat pr_img = static_resize(image);
        Blob::Ptr imgBlob = infer_request.GetBlob(input_name);     // just wrap Mat data by Blob::Ptr
	    blobFromImage(pr_img, imgBlob);

        // infer_request.SetBlob(input_name, imgBlob);  // infer_request accepts input blob of any size
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 7. Do inference
        // --------------------------------------------------------
        /* Running the request synchronously */
        infer_request.Infer();
        // -----------------------------------------------------------------------------------------------------

        // --------------------------- Step 8. Process output
        // ------------------------------------------------------
        const Blob::Ptr output_blob = infer_request.GetBlob(output_name);
        MemoryBlob::CPtr moutput = as<MemoryBlob>(output_blob);
        if (!moutput) {
            throw std::logic_error("We expect output to be inherited from MemoryBlob, "
                                   "but by fact we were not able to cast output to MemoryBlob");
        }
        // locked memory holder should be alive all time while access to its buffer
        // happens
        auto moutputHolder = moutput->rmap();
        const float* net_pred = moutputHolder.as<const PrecisionTrait<Precision::FP32>::value_type*>();
        
	    int img_w = image.cols;
        int img_h = image.rows;
	    float scale = std::min(INPUT_W / (image.cols*1.0), INPUT_H / (image.rows*1.0));
        std::vector<Object> objects;

        decode_outputs(net_pred, objects, scale, img_w, img_h);
        draw_objects(image, objects);

            // -----------------------------------------------------------------------------------------------------
        } catch (const std::exception& ex) {
            std::cerr << ex.what() << std::endl;
            return EXIT_FAILURE;
    }
    return EXIT_SUCCESS;
}


================================================
FILE: demo/OpenVINO/python/README.md
================================================
# YOLOX-OpenVINO in Python

This tutorial includes a Python demo for OpenVINO, as well as some converted models.

### Download OpenVINO models.

| Model | Parameters | GFLOPs | Test Size | mAP | Weights |
|:------| :----: | :----: | :---: | :---: | :---: |
|  [YOLOX-Nano](../../../exps/default/nano.py) |  0.91M  | 1.08 | 416x416 | 25.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_nano_openvino.tar.gz) |
|  [YOLOX-Tiny](../../../exps/default/yolox_tiny.py) | 5.06M     | 6.45 | 416x416 |32.8 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_tiny_openvino.tar.gz) |
|  [YOLOX-S](../../../exps/default/yolox_s.py) | 9.0M | 26.8 | 640x640 |40.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_s_openvino.tar.gz) |
|  [YOLOX-M](../../../exps/default/yolox_m.py) | 25.3M | 73.8 | 640x640 |47.2 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_m_openvino.tar.gz) |
|  [YOLOX-L](../../../exps/default/yolox_l.py) | 54.2M | 155.6 | 640x640 |50.1 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_l_openvino.tar.gz) |
|  [YOLOX-Darknet53](../../../exps/default/yolov3.py) | 63.72M | 185.3 | 640x640 |48.0 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_dark_openvino.tar.gz) | 
|  [YOLOX-X](../../../exps/default/yolox_x.py) | 99.1M | 281.9 | 640x640 |51.5 | [github](https://github.com/Megvii-BaseDetection/YOLOX/releases/download/0.1.1rc0/yolox_x_openvino.tar.gz) |

## Install OpenVINO Toolkit

Please visit [Openvino Homepage](https://docs.openvinotoolkit.org/latest/get_started_guides.html) for more details.

## Set up the Environment

### For Linux

**Option1. Set up the environment tempororally. You need to run this command everytime you start a new shell window.**

```shell
source /opt/intel/openvino_2021/bin/setupvars.sh
```

**Option2. Set up the environment permenantly.**

*Step1.* For Linux:
```shell
vim ~/.bashrc
```

*Step2.* Add the following line into your file:

```shell
source /opt/intel/openvino_2021/bin/setupvars.sh
```

*Step3.* Save and exit the file, then run:

```shell
source ~/.bashrc
```


## Convert model

1. Export ONNX model

   Please refer to the [ONNX tutorial](https://github.com/Megvii-BaseDetection/YOLOX/demo/ONNXRuntime). **Note that you should set --opset to 10, otherwise your next step will fail.**

2. Convert ONNX to OpenVINO

   ``` shell
   cd <INSTSLL_DIR>/openvino_2021/deployment_tools/model_optimizer
   ```

   Install requirements for convert tool

   ```shell
   sudo ./install_prerequisites/install_prerequisites_onnx.sh
   ```

   Then convert model.
   ```shell
   python3 mo.py --input_model <ONNX_MODEL> --input_shape <INPUT_SHAPE> [--data_type FP16]
   ```
   For example:
   ```shell
   python3 mo.py --input_model yolox.onnx --input_shape [1,3,640,640] --data_type FP16 --output_dir converted_output
   ```

## Demo

### python

```shell
python openvino_inference.py -m <XML_MODEL_PATH> -i <IMAGE_PATH> 
```
or
```shell
python openvino_inference.py -m <XML_MODEL_PATH> -i <IMAGE_PATH> -o <OUTPUT_DIR> -s <SCORE_THR> -d <DEVICE>
```



================================================
FILE: demo/OpenVINO/python/openvino_inference.py
================================================
#!/usr/bin/env python3
# -*- coding: utf-8 -*-
# Copyright (C) 2018-2021 Intel Corporation
# SPDX-License-Identifier: Apache-2.0
# Copyright (c) Megvii, Inc. and its affiliates.

import argparse
import logging as log
import os
import sys

import cv2
import numpy as np

from openvino.inference_engine import IECore

from yolox.data.data_augment import preproc as preprocess
from yolox.data.datasets import COCO_CLASSES
from yolox.utils import mkdir, multiclass_nms, demo_postprocess, vis


def parse_args() -> argparse.Namespace:
    """Parse and return command line arguments"""
    parser = argparse.ArgumentParser(add_help=False)
    args = parser.add_argument_group('Options')
    args.add_argument(
        '-h',
        '--help',
        action='help',
        help='Show this help message and exit.')
    args.add_argument(
        '-m',
        '--model',
        required=True,
        type=str,
        help='Required. Path to an .xml or .onnx file with a trained model.')
    args.add_argument(
        '-i',
        '--input',
        required=True,
        type=str,
        help='Required. Path to an image file.')
    args.add_argument(
        '-o',
        '--output_dir',
        type=str,
        default='demo_output',
        help='Path to your output dir.')
    args.add_argument(
        '-s',
        '--score_thr',
        type=float,
        default=0.3,
        help="Score threshould to visualize the result.")
    args.add_argument(
        '-d',
        '--device',
        default='CPU',
        type=str,
        help='Optional. Specify the target device to infer on; CPU, GPU, \
              MYRIAD, HDDL or HETERO: is acceptable. The sample will look \
              for a suitable plugin for device specified. Default value \
              is CPU.')
    args.add_argument(
        '--labels',
        default=None,
        type=str,
        help='Option:al. Path to a labels mapping file.')
    args.add_argument(
        '-nt',
        '--number_top',
        default=10,
        type=int,
        help='Optional. Number of top results.')
    return parser.parse_args()


def main():
    log.basicConfig(format='[ %(levelname)s ] %(message)s', level=log.INFO, stream=sys.stdout)
    args = parse_args()

    # ---------------------------Step 1. Initialize inference engine core--------------------------------------------------
    log.info('Creating Inference Engine')
    ie = IECore()

    # ---------------------------Step 2. Read a model in OpenVINO Intermediate Representation or ONNX format---------------
    log.info(f'Reading the network: {args.model}')
    # (.xml and .bin files) or (.onnx file)
    net = ie.read_network(model=args.model)

    if len(net.input_info) != 1:
        log.error('Sample supports only single input topologies')
        return -1
    if len(net.outputs) != 1:
        log.error('Sample supports only single output topologies')
        return -1

    # ---------------------------Step 3. Configure input & output----------------------------------------------------------
    log.info('Configuring input and output blobs')
    # Get names of input and output blobs
    input_blob = next(iter(net.input_info))
    out_blob = next(iter(net.outputs))

    # Set input and output precision manually
    net.input_info[input_blob].precision = 'FP32'
    net.outputs[out_blob].precision = 'FP16'

    # Get a number of classes recognized by a model
    num_of_classes = max(net.outputs[out_blob].shape)

    # ---------------------------Step 4. Loading model to the device-------------------------------------------------------
    log.info('Loading the model to the plugin')
    exec_net = ie.load_network(network=net, device_name=args.device)

    # ---------------------------Step 5. Create infer request--------------------------------------------------------------
    # load_network() method of the IECore class with a specified number of requests (default 1) returns an ExecutableNetwork
    # instance which stores infer requests. So you already created Infer requests in the previous step.

    # ---------------------------Step 6. Prepare input---------------------------------------------------------------------
    origin_img = cv2.imread(args.input)
    _, _, h, w = net.input_info[input_blob].input_data.shape
    image, ratio = preprocess(origin_img, (h, w))

    # ---------------------------Step 7. Do inference----------------------------------------------------------------------
    log.info('Starting inference in synchronous mode')
    res = exec_net.infer(inputs={input_blob: image})

    # ---------------------------Step 8. Process output--------------------------------------------------------------------
    res = res[out_blob]

    predictions = demo_postprocess(res, (h, w))[0]

    boxes = predictions[:, :4]
    scores = predictions[:, 4, None] * predictions[:, 5:]

    boxes_xyxy = np.ones_like(boxes)
    boxes_xyxy[:, 0] = boxes[:, 0] - boxes[:, 2]/2.
    boxes_xyxy[:, 1] = boxes[:, 1] - boxes[:, 3]/2.
    boxes_xyxy[:, 2] = boxes[:, 0] + boxes[:, 2]/2.
    boxes_xyxy[:, 3] = boxes[:, 1] + boxes[:, 3]/2.
    boxes_xyxy /= ratio
    dets = multiclass_nms(boxes_xyxy, scores, nms_thr=0.45, score_thr=0.1)

    if dets is not None:
        final_boxes = dets[:, :4]
        final_scores, final_cls_inds = dets[:, 4], dets[:, 5]
        origin_img = vis(origin_img, final_boxes, final_scores, final_cls_inds,
                         conf=args.score_thr, class_names=COCO_CLASSES)

    mkdir(args.output_dir)
    output_path = os.path.join(args.output_dir, os.path.basename(args.input))
    cv2.imwrite(output_path, origin_img)


if __name__ == '__main__':
    sys.exit(main())


================================================
FILE: demo/TensorRT/cpp/CMakeLists.txt
================================================
cmake_minimum_required(VERSION 2.6)

project(yolox)

add_definitions(-std=c++11)

option(CUDA_USE_STATIC_CUDA_RUNTIME OFF)
set(CMAKE_CXX_STANDARD 11)
set(CMAKE_BUILD_TYPE Debug)

find_package(CUDA REQUIRED)

include_directories(${PROJECT_SOURCE_DIR}/include)
# include and link dirs of cuda and tensorrt, you need adapt them if yours are different
# cuda
include_directories(/data/cuda/cuda-10.2/cuda/include)
link_directories(/data/cuda/cuda-10.2/cuda/lib64)
# cudnn
include_directories(/data/cuda/cuda-10.2/cudnn/v8.0.4/include)
link_directories(/data/cuda/cuda-10.2/cudnn/v8.0.4/lib64)
# tensorrt
include_directories(/data/cuda/cuda-10.2/TensorRT/v7.2.1.6/include)
link_directories(/data/cuda/cuda-10.2/TensorRT/v7.2.1.6/lib)

set(CMAKE_CXX_FLAGS "${CMAKE_CXX_FLAGS} -std=c++11 -Wall -Ofast -Wfatal-errors -D_MWAITXINTRIN_H_INCLUDED")

find_package(OpenCV)
include_directories(${OpenCV_INCLUDE_DIRS})

add_executable(yolox ${PROJECT_SOURCE_DIR}/yolox.cpp)
target_link_libraries(yolox nvinfer)
target_link_libraries(yolox cudart)
target_link_libraries(yolox ${OpenCV_LIBS})

add_definitions(-O2 -pthread)



================================================
FILE: demo/TensorRT/cpp/README.md
================================================
# YOLOX-TensorRT in C++

As YOLOX models are easy to convert to tensorrt using [torch2trt gitrepo](https://github.com/NVIDIA-AI-IOT/torch2trt), 
our C++ demo does not include the model converting or constructing like other tenorrt demos.


## Step 1: Prepare serialized engine file

Follow the trt [python demo README](https://github.com/Megvii-BaseDetection/YOLOX/blob/main/demo/TensorRT/python/README.md) to convert and save the serialized engine file.

Check the 'model_trt.engine' file generated from Step 1, which will be automatically saved at the current demo dir.


## Step 2: build the demo

Please follow the [TensorRT Installation Guide](https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html) to install TensorRT.

And you should set the TensorRT path and CUDA path in CMakeLists.txt.

If you train your custom dataset, you may need to modify the value of `num_class`.

```c++
const int num_class = 80;
```

Install opencv with ```sudo apt-get install libopencv-dev``` (we don't need a higher version of opencv like v3.3+). 

build the demo:

```shell
mkdir build
cd build
cmake ..
make
```

Then run the demo:

```shell
./yolox ../model_trt.engine -i ../../../../assets/dog.jpg
```

or

```shell
./yolox <path/to/your/engine_file> -i <path/to/image>
```

NOTE: for `trtexec` users, modify `INPUT_BLOB_NAME` and `OUTPUT_BLOB_NAME` as the following code.
```
const char* INPUT_BLOB_NAME = "images";
const char* OUTPUT_BLOB_NAME = "output";
```

Here is the command to convert the small onnx model to tensorrt engine file:
```
trtexec --onnx=yolox_s.onnx --saveEngine=yolox_s.trt
```


================================================
FILE: demo/TensorRT/cpp/logging.h
================================================
/*
 * Copyright (c) 2019, NVIDIA CORPORATION. All rights reserved.
 *
 * Licensed under the Apache License, Version 2.0 (the "License");
 * you may not use this file except in compliance with the License.
 * You may obtain a copy of the License at
 *
 *     http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */

#ifndef TENSORRT_LOGGING_H
#define TENSORRT_LOGGING_H

#include "NvInferRuntimeCommon.h"
#include <cassert>
#include <ctime>
#include <iomanip>
#include <iostream>
#include <ostream>
#include <sstream>
#include <string>

using Severity = nvinfer1::ILogger::Severity;

class LogStreamConsumerBuffer : public std::stringbuf
{
public:
    LogStreamConsumerBuffer(std::ostream& stream, const std::string& prefix, bool shouldLog)
        : mOutput(stream)
        , mPrefix(prefix)
        , mShouldLog(shouldLog)
    {
    }

    LogStreamConsumerBuffer(LogStreamConsumerBuffer&& other)
        : mOutput(other.mOutput)
    {
    }

    ~LogStreamConsumerBuffer()
    {
        // std::streambuf::pbase() gives a pointer to the beginning of the buffered part of the output sequence
        // std::streambuf::pptr() gives a pointer to the current position of the output sequence
        // if the pointer to the beginning is not equal to the pointer to the current position,
        // call putOutput() to log the output to the stream
        if (pbase() != pptr())
        {
            putOutput();
        }
    }

    // synchronizes the stream buffer and returns 0 on success
    // synchronizing the stream buffer consists of inserting the buffer contents into the stream,
    // resetting the buffer and flushing the stream
    virtual int sync()
    {
        putOutput();
        return 0;
    }

    void putOutput()
    {
        if (mShouldLog)
        {
            // prepend timestamp
            std::time_t timestamp = std::time(nullptr);
            tm* tm_local = std::localtime(&timestamp);
            std::cout << "[";
            std::cout << std::setw(2) << std::setfill('0') << 1 + tm_local->tm_mon << "/";
            std::cout << std::setw(2) << std::setfill('0') << tm_local->tm_mday << "/";
            std::cout << std::setw(4) << std::setfill('0') << 1900 + tm_local->tm_year << "-";
            std::cout << std::setw(2) << std::setfill('0') << tm_local->tm_hour << ":";
            std::cout << std::setw(2) << std::setfill('0') << tm_local->tm_min << ":";
            std::cout << std::setw(2) << std::setfill('0') << tm_local->tm_sec << "] ";
            // std::stringbuf::str() gets the string contents of the buffer
            // insert the buffer contents pre-appended by the appropriate prefix into the stream
            mOutput << mPrefix << str();
            // set the buffer to empty
            str("");
            // flush the stream
            mOutput.flush();
        }
    }

    void setShouldLog(bool shouldLog)
    {
        mShouldLog = shouldLog;
    }

private:
    std::ostream& mOutput;
    std::string mPrefix;
    bool mShouldLog;
};

//!
//! \class LogStreamConsumerBase
//! \brief Convenience object used to initialize LogStreamConsumerBuffer before std::ostream in LogStreamConsumer
//!
class LogStreamConsumerBase
{
public:
    LogStreamConsumerBase(std::ostream& stream, const std::string& prefix, bool shouldLog)
        : mBuffer(stream, prefix, shouldLog)
    {
    }

protected:
    LogStreamConsumerBuffer mBuffer;
};

//!
//! \class LogStreamConsumer
//! \brief Convenience object used to facilitate use of C++ stream syntax when logging messages.
//!  Order of base classes is LogStreamConsumerBase and then std::ostream.
//!  This is because the LogStreamConsumerBase class is used to initialize the LogStreamConsumerBuffer member field
//!  in LogStreamConsumer and then the address of the buffer is passed to std::ostream.
//!  This is necessary to prevent the address of an uninitialized buffer from being passed to std::ostream.
//!  Please do not change the order of the parent classes.
//!
class LogStreamConsumer : protected LogStreamConsumerBase, public std::ostream
{
public:
    //! \brief Creates a LogStreamConsumer which logs messages with level severity.
    //!  Reportable severity determines if the messages are severe enough to be logged.
    LogStreamConsumer(Severity reportableSeverity, Severity severity)
        : LogStreamConsumerBase(severityOstream(severity), severityPrefix(severity), severity <= reportableSeverity)
        , std::ostream(&mBuffer) // links the stream buffer with the stream
        , mShouldLog(severity <= reportableSeverity)
        , mSeverity(severity)
    {
    }

    LogStreamConsumer(LogStreamConsumer&& other)
        : LogStreamConsumerBase(severityOstream(other.mSeverity), severityPrefix(other.mSeverity), other.mShouldLog)
        , std::ostream(&mBuffer) // links the stream buffer with the stream
        , mShouldLog(other.mShouldLog)
        , mSeverity(other.mSeverity)
    {
    }

    void setReportableSeverity(Severity reportableSeverity)
    {
        mShouldLog = mSeverity <= reportableSeverity;
        mBuffer.setShouldLog(mShouldLog);
    }

private:
    static std::ostream& severityOstream(Severity severity)
    {
        return severity >= Severity::kINFO ? std::cout : std::cerr;
    }

    static std::string severityPrefix(Severity severity)
    {
        switch (severity)
        {
        case Severity::kINTERNAL_ERROR: return "[F] ";
        case Severity::kERROR: return "[E] ";
        case Severity::kWARNING: return "[W] ";
        case Severity::kINFO: return "[I] ";
        case Severity::kVERBOSE: return "[V] ";
        default: assert(0); return "";
        }
    }

    bool mShouldLog;
    Severity mSeverity;
};

//! \class Logger
//!
//! \brief Class which manages logging of TensorRT tools and samples
//!
//! \details This class provides a common interface for TensorRT tools and samples to log information to the console,
//! and supports logging two types of messages:
//!
//! - Debugging messages with an associated severity (info, warning, error, or internal error/fatal)
//! - Test pass/fail messages
//!
//! The advantage of having all samples use this class for logging as opposed to emitting directly to stdout/stderr is
//! that the logic for controlling the verbosity and formatting of sample output is centralized in one location.
//!
//! In the future, this class could be extended to support dumping test results to a file in some standard format
//! (for example, JUnit XML), and providing additional metadata (e.g. timing the duration of a test run).
//!
//! TODO: For backwards compatibility with existing samples, this class inherits directly from the nvinfer1::ILogger
//! interface, which is problematic since there isn't a clean separation between messages coming from the TensorRT
//! library and messages coming from the sample.
//!
//! In the future (once all samples are updated to use Logger::getTRTLogger() to access the ILogger) we can refactor the
//! class to eliminate the inheritance and instead make the nvinfer1::ILogger implementation a member of the Logger
//! object.

class Logger : public nvinfer1::ILogger
{
public:
    Logger(Severity severity = Severity::kWARNING)
        : mReportableSeverity(severity)
    {
    }

    //!
    //! \enum TestResult
    //! \brief Represents the state of a given test
    //!
    enum class TestResult
    {
        kRUNNING, //!< The test is running
        kPASSED,  //!< The test passed
        kFAILED,  //!< The test failed
        kWAIVED   //!< The test was waived
    };

    //!
    //! \brief Forward-compatible method for retrieving the nvinfer::ILogger associated with this Logger
    //! \return The nvinfer1::ILogger associated with this Logger
    //!
    //! TODO Once all samples are updated to use this method to register the logger with TensorRT,
    //! we can eliminate the inheritance of Logger from ILogger
    //!
    nvinfer1::ILogger& getTRTLogger()
    {
        return *this;
    }

    //!
    //! \brief Implementation of the nvinfer1::ILogger::log() virtual method
    //!
    //! Note samples should not be calling this function directly; it will eventually go away once we eliminate the
    //! inheritance from nvinfer1::ILogger
    //!
    void log(Severity severity, const char* msg) noexcept override
    {
        LogStreamConsumer(mReportableSeverity, severity) << "[TRT] " << std::string(msg) << std::endl;
    }

    //!
    //! \brief Method for controlling the verbosity of logging output
    //!
    //! \param severity The logger will only emit messages that have severity of this level or higher.
    //!
    void setReportableSeverity(Severity severity)
    {
        mReportableSeverity = severity;
    }

    //!
    //! \brief Opaque handle that holds logging information for a particular test
    //!
    //! This object is an opaque handle to information used by the Logger to print test results.
    //! The sample must call Logger::defineTest() in order to obtain a TestAtom that can be used
    //! with Logger::reportTest{Start,End}().
    //!
    class TestAtom
    {
    public:
        TestAtom(TestAtom&&) = default;

    private:
        friend class Logger;

        TestAtom(bool started, const std::string& name, const std::string& cmdline)
            : mStarted(started)
            , mName(name)
            , mCmdline(cmdline)
        {
        }

        bool mStarted;
        std::string mName;
        std::string mCmdline;
    };

    //!
    //! \brief Define a test for logging
    //!
    //! \param[in] name The name of the test.  This should be a string starting with
    //!                  "TensorRT" and containing dot-separated strings containing
    //!                  the characters [A-Za-z0-9_].
    //!                  For example, "TensorRT.sample_googlenet"
    //! \param[in] cmdline The command line used to reproduce the test
    //
    //! \return a TestAtom that can be used in Logger::reportTest{Start,End}().
    //!
    static TestAtom defineTest(const std::string& name, const std::string& cmdline)
    {
        return TestAtom(false, name, cmdline);
    }

    //!
    //! \brief A convenience overloaded version of defineTest() that accepts an array of command-line arguments
    //!        as input
    //!
    //! \param[in] name The name of the test
    //! \param[in] argc The number of command-line arguments
    //! \param[in] argv The array of command-line arguments (given as C strings)
    //!
    //! \return a TestAtom that can be used in Logger::reportTest{Start,End}().
    static TestAtom defineTest(const std::string& name, int argc, char const* const* argv)
    {
        auto cmdline = genCmdlineString(argc, argv);
        return defineTest(name, cmdline);
    }

    //!
    //! \brief Report that a test has started.
    //!
    //! \pre reportTestStart() has not been called yet for the given testAtom
    //!
    //! \param[in] testAtom The handle to the test that has started
    //!
    static void reportTestStart(TestAtom& testAtom)
    {
        reportTestResult(testAtom, TestResult::kRUNNING);
        assert(!testAtom.mStarted);
        testAtom.mStarted = true;
    }

    //!
    //! \brief Report that a test has ended.
    //!
    //! \pre reportTestStart() has been called for the given testAtom
    //!
    //! \param[in] testAtom The handle to the test that has ended
    //! \param[in] result The result of the test. Should be one of TestResult::kPASSED,
    //!                   TestResult::kFAILED, TestResult::kWAIVED
    //!
    static void reportTestEnd(const TestAtom& testAtom, TestResult result)
    {
        assert(result != TestResult::kRUNNING);
        assert(testAtom.mStarted);
        reportTestResult(testAtom, result);
    }

    static int reportPass(const TestAtom& testAtom)
    {
        reportTestEnd(testAtom, TestResult::kPASSED);
        return EXIT_SUCCESS;
    }

    static int reportFail(const TestAtom& testAtom)
    {
        reportTestEnd(testAtom, TestResult::kFAILED);
        return EXIT_FAILURE;
    }

    static int reportWaive(const TestAtom& testAtom)
    {
        reportTestEnd(testAtom, TestResult::kWAIVED);
        return EXIT_SUCCESS;
    }

    static int reportTest(const TestAtom& testAtom, bool pass)
    {
        return pass ? reportPass(testAtom) : reportFail(testAtom);
    }

    Severity getReportableSeverity() const
    {
        return mReportableSeverity;
    }

private:
    //!
    //! \brief returns an appropriate string for prefixing a log message with the given severity
    //!
    static const char* severityPrefix(Severity severity)
    {
        switch (severity)
        {
        case Severity::kINTERNAL_ERROR: return "[F] ";
        case Severity::kERROR: return "[E] ";
        case Severity::kWARNING: return "[W] ";
        case Severity::kINFO: return "[I] ";
        case Severity::kVERBOSE: return "[V] ";
        default: assert(0); return "";
        }
    }

    //!
    //! \brief returns an appropriate string for prefixing a test result message with the given result
    //!
    static const char* testResultString(TestResult result)
    {
        switch (result)
        {
        case TestResult::kRUNNING: return "RUNNING";
        case TestResult::kPASSED: return "PASSED";
        case TestResult::kFAILED: return "FAILED";
        case TestResult::kWAIVED: return "WAIVED";
        default: assert(0); return "";
        }
    }

    //!
    //! \brief returns an appropriate output stream (cout or cerr) to use with the given severity
    //!
    static std::ostream& severityOstream(Severity severity)
    {
        return severity >= Severity::kINFO ? std::cout : std::cerr;
    }

    //!
    //! \brief method that implements logging test results
    //!
    static void reportTestResult(const TestAtom& testAtom, TestResult result)
    {
        severityOstream(Severity::kINFO) << "&&&& " << testResultString(result) << " " << testAtom.mName << " # "
                                         << testAtom.mCmdline << std::endl;
    }

    //!
    //! \brief generate a command line string from the given (argc, argv) values
    //!
    static std::string genCmdlineString(int argc, char const* const* argv)
    {
        std::stringstream ss;
        for (int i = 0; i < argc; i++)
        {
            if (i > 0)
                ss << " ";
            ss << argv[i];
        }
        return ss.str();
    }

    Severity mReportableSeverity;
};

namespace
{

//!
//! \brief produces a LogStreamConsumer object that can be used to log messages of severity kVERBOSE
//!
//! Example usage:
//!
//!     LOG_VERBOSE(logger) << "hello world" << std::endl;
//!
inline LogStreamConsumer LOG_VERBOSE(const Logger& logger)
{
    return LogStreamConsumer(logger.getReportableSeverity(), Severity::kVERBOSE);
}

//!
//! \brief produces a LogStreamConsumer object that can be used to log messages of severity kINFO
//!
//! Example usage:
//!
//!     LOG_INFO(logger) << "hello world" << std::endl;
//!
inline LogStreamConsumer LOG_INFO(const Logger& logger)
{
    return LogStreamConsumer(logger.getReportableSeverity(), Severity::kINFO);
}

//!
//! \brief produces a LogStreamConsumer object that can be used to log messages of severity kWARNING
//!
//! Example usage:
//!
//!     LOG_WARN(logger) << "hello world" << std::endl;
//!
inline LogStreamConsumer LOG_WARN(const Logger& logger)
{
    return LogStreamConsumer(logger.getReportableSeverity(), Severity::kWARNING);
}

//!
//! \brief produces a LogStreamConsumer object that can be used to log messages of severity kERROR
//!
//! Example usage:
//!
//!     LOG_ERROR(logger) << "hello world" << std::endl;
//!
inline LogStreamConsumer LOG_ERROR(const Logger& logger)
{
    return LogStreamConsumer(logger.getReportableSeverity(), Severity::kERROR);
}

//!
//! \brief produces a LogStreamConsumer object that can be used to log messages of severity kINTERNAL_ERROR
//         ("fatal" severity)
//!
//! Example usage:
//!
//!     LOG_FATAL(logger) << "hello world" << std::endl;
//!
inline LogStreamConsumer LOG_FATAL(const Logger& logger)
{
    return LogStreamConsumer(logger.getReportableSeverity(), Severity::kINTERNAL_ERROR);
}

} // anonymous namespace

#endif // TENSORRT_LOGGING_H


================================================
FILE: demo/TensorRT/cpp/yolox.cpp
================================================
#include <fstream>
#include <iostream>
#include <sstream>
#include <numeric>
#include <chrono>
#include <vector>
#include <opencv2/opencv.hpp>
#include <dirent.h>
#include "NvInfer.h"
#include "cuda_runtime_api.h"
#include "logging.h"

#define CHECK(status) \
    do\
    {\
        auto ret = (status);\
        if (ret != 0)\
        {\
            std::cerr << "Cuda failure: " << ret << std::endl;\
            abort();\
        }\
    } while (0)

#define DEVICE 0  // GPU id
#define NMS_THRESH 0.45
#define BBOX_CONF_THRESH 0.3

using namespace nvinfer1;

// stuff we know about the network and the input/output blobs
static const int INPUT_W = 640;
static const int INPUT_H = 640;
static const int NUM_CLASSES = 80;
const char* INPUT_BLOB_NAME = "input_0";
const char* OUTPUT_BLOB_NAME = "output_0";
static Logger gLogger;

cv::Mat static_resize(cv::Mat& img) {
    float r = std::min(INPUT_W / (img.cols*1.0), INPUT_H / (img.rows*1.0));
    // r = std::min(r, 1.0f);
    int unpad_w = r * img.cols;
    int unpad_h = r * img.rows;
    cv::Mat re(unpad_h, unpad_w, CV_8UC3);
    cv::resize(img, re, re.size());
    cv::Mat out(INPUT_H, INPUT_W, CV_8UC3, cv::Scalar(114, 114, 114));
    re.copyTo(out(cv::Rect(0, 0, re.cols, re.rows)));
    return out;
}

struct Object
{
    cv::Rect_<float> rect;
    int label;
    float prob;
};

struct GridAndStride
{
    int grid0;
    int grid1;
    int stride;
};

static void generate_grids_and_stride(std::vector<int>& strides, std::vector<GridAndStride>& grid_strides)
{
    for (auto stride : strides)
    {
        int num_grid_y = INPUT_H / stride;
        int num_grid_x = INPUT_W / stride;
        for (int g1 = 0; g1 < num_grid_y; g1++)
        {
            for (int g0 = 0; g0 < num_grid_x; g0++)
            {
                grid_strides.push_back((GridAndStride){g0, g1, stride});
            }
        }
    }
}

static inline float intersection_area(const Object& a, const Object& b)
{
    cv::Rect_<float> inter = a.rect & b.rect;
    return inter.area();
}

static void qsort_descent_inplace(std::vector<Object>& faceobjects, int left, int right)
{
    int i = left;
    int j = right;
    float p = faceobjects[(left + right) / 2].prob;

    while (i <= j)
    {
        while (faceobjects[i].prob > p)
            i++;

        while (faceobjects[j].prob < p)
            j--;

        if (i <= j)
        {
            // swap
            std::swap(faceobjects[i], faceobjects[j]);

            i++;
            j--;
        }
    }

    #pragma omp parallel sections
    {
        #pragma omp section
        {
            if (left < j) qsort_descent_inplace(faceobjects, left, j);
        }
        #pragma omp section
        {
            if (i < right) qsort_descent_inplace(faceobjects, i, right);
        }
    }
}

static void qsort_descent_inplace(std::vector<Object>& objects)
{
    if (objects.empty())
        return;

    qsort_descent_inplace(objects, 0, objects.size() - 1);
}

static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, std::vector<int>& picked, float nms_threshold)
{
    picked.clear();

    const int n = faceobjects.size();

    std::vector<float> areas(n);
    for (int i = 0; i < n; i++)
    {
        areas[i] = faceobjects[i].rect.area();
    }

    for (int i = 0; i < n; i++)
    {
        const Object& a = faceobjects[i];

        int keep = 1;
        for (int j = 0; j < (int)picked.size(); j++)
        {
            const Object& b = faceobjects[picked[j]];

            // intersection over union
            float inter_area = intersection_area(a, b);
            float union_area = areas[i] + areas[picked[j]] - inter_area;
            // float IoU = inter_area / union_area
            if (inter_area / union_area > nms_threshold)
                keep = 0;
        }

        if (keep)
            picked.push_back(i);
    }
}


static void generate_yolox_proposals(std::vector<GridAndStride> grid_strides, float* feat_blob, float prob_threshold, std::vector<Object>& objects)
{

    const int num_anchors = grid_strides.size();

    for (int anchor_idx = 0; anchor_idx < num_anchors; anchor_idx++)
    {
        const int grid0 = grid_strides[anchor_idx].grid0;
        const int grid1 = grid_strides[anchor_idx].grid1;
        const int stride = grid_strides[anchor_idx].stride;

        const int basic_pos = anchor_idx * (NUM_CLASSES + 5);

        // yolox/models/yolo_head.py decode logic
        float x_center = (feat_blob[basic_pos+0] + grid0) * stride;
        float y_center = (feat_blob[basic_pos+1] + grid1) * stride;
        float w = exp(feat_blob[basic_pos+2]) * stride;
        float h = exp(feat_blob[basic_pos+3]) * stride;
        float x0 = x_center - w * 0.5f;
        float y0 = y_center - h * 0.5f;

        float box_objectness = feat_blob[basic_pos+4];
        for (int class_idx = 0; class_idx < NUM_CLASSES; class_idx++)
        {
            float box_cls_score = feat_blob[basic_pos + 5 + class_idx];
            float box_prob = box_objectness * box_cls_score;
            if (box_prob > prob_threshold)
            {
                Object obj;
                obj.rect.x = x0;
                obj.rect.y = y0;
                obj.rect.width = w;
                obj.rect.height = h;
                obj.label = class_idx;
                obj.prob = box_prob;

                objects.push_back(obj);
            }

        } // class loop

    } // point anchor loop
}

float* blobFromImage(cv::Mat& img){
    float* blob = new float[img.total()*3];
    int channels = 3;
    int img_h = img.rows;
    int img_w = img.cols;
    for (size_t c = 0; c < channels; c++) 
    {
        for (size_t  h = 0; h < img_h; h++) 
        {
            for (size_t w = 0; w < img_w; w++) 
            {
                blob[c * img_w * img_h + h * img_w + w] =
                    (float)img.at<cv::Vec3b>(h, w)[c];
            }
        }
    }
    return blob;
}


static void decode_outputs(float* prob, std::vector<Object>& objects, float scale, const int img_w, const int img_h) {
        std::vector<Object> proposals;
        std::vector<int> strides = {8, 16, 32};
        std::vector<GridAndStride> grid_strides;
        generate_grids_and_stride(strides, grid_strides);
        generate_yolox_proposals(grid_strides, prob,  BBOX_CONF_THRESH, proposals);
        std::cout << "num of boxes before nms: " << proposals.size() << std::endl;

        qsort_descent_inplace(proposals);

        std::vector<int> picked;
        nms_sorted_bboxes(proposals, picked, NMS_THRESH);


        int count = picked.size();

        std::cout << "num of boxes: " << count << std::endl;

        objects.resize(count);
        for (int i = 0; i < count; i++)
        {
            objects[i] = proposals[picked[i]];

            // adjust offset to original unpadded
            float x0 = (objects[i].rect.x) / scale;
            float y0 = (objects[i].rect.y) / scale;
            float x1 = (objects[i].rect.x + objects[i].rect.width) / scale;
            float y1 = (objects[i].rect.y + objects[i].rect.height) / scale;

            // clip
            x0 = std::max(std::min(x0, (float)(img_w - 1)), 0.f);
            y0 = std::max(std::min(y0, (float)(img_h - 1)), 0.f);
            x1 = std::max(std::min(x1, (float)(img_w - 1)), 0.f);
            y1 = std::max(std::min(y1, (float)(img_h - 1)), 0.f);

            objects[i].rect.x = x0;
            objects[i].rect.y = y0;
            objects[i].rect.width = x1 - x0;
            objects[i].rect.height = y1 - y0;
        }
}

const float color_list[80][3] =
{
    {0.000, 0.447, 0.741},
    {0.850, 0.325, 0.098},
    {0.929, 0.694, 0.125},
    {0.494, 0.184, 0.556},
    {0.466, 0.674, 0.188},
    {0.301, 0.745, 0.933},
    {0.635, 0.078, 0.184},
    {0.300, 0.300, 0.300},
    {0.600, 0.600, 0.600},
    {1.000, 0.000, 0.000},
    {1.000, 0.500, 0.000},
    {0.749, 0.749, 0.000},
    {0.000, 1.000, 0.000},
    {0.000, 0.000, 1.000},
    {0.667, 0.000, 1.000},
    {0.333, 0.333, 0.000},
    {0.333, 0.667, 0.000},
    {0.333, 1.000, 0.000},
    {0.667, 0.333, 0.000},
    {0.667, 0.667, 0.000},
    {0.667, 1.000, 0.000},
    {1.000, 0.333, 0.000},
    {1.000, 0.667, 0.000},
    {1.000, 1.000, 0.000},
    {0.000, 0.333, 0.500},
    {0.000, 0.667, 0.500},
    {0.000, 1.000, 0.500},
    {0.333, 0.000, 0.500},
    {0.333, 0.333, 0.500},
    {0.333, 0.667, 0.500},
    {0.333, 1.000, 0.500},
    {0.667, 0.000, 0.500},
    {0.667, 0.333, 0.500},
    {0.667, 0.667, 0.500},
    {0.667, 1.000, 0.500},
    {1.000, 0.000, 0.500},
    {1.000, 0.333, 0.500},
    {1.000, 0.667, 0.500},
    {1.000, 1.000, 0.500},
    {0.000, 0.333, 1.000},
    {0.000, 0.667, 1.000},
    {0.000, 1.000, 1.000},
    {0.333, 0.000, 1.000},
    {0.333, 0.333, 1.000},
    {0.333, 0.667, 1.000},
    {0.333, 1.000, 1.000},
    {0.667, 0.000, 1.000},
    {0.667, 0.333, 1.000},
    {0.667, 0.667, 1.000},
    {0.667, 1.000, 1.000},
    {1.000, 0.000, 1.000},
    {1.000, 0.333, 1.000},
    {1.000, 0.667, 1.000},
    {0.333, 0.000, 0.000},
    {0.500, 0.000, 0.000},
    {0.667, 0.000, 0.000},
    {0.833, 0.000, 0.000},
    {1.000, 0.000, 0.000},
    {0.000, 0.167, 0.000},
    {0.000, 0.333, 0.000},
    {0.000, 0.500, 0.000},
    {0.000, 0.667, 0.000},
    {0.000, 0.833, 0.000},
    {0.000, 1.000, 0.000},
    {0.000, 0.000, 0.167},
    {0.000, 0.000, 0.333},
    {0.000, 0.000, 0.500},
    {0.000, 0.000, 0.667},
    {0.000, 0.000, 0.833},
    {0.000, 0.000, 1.000},
    {0.000, 0.000, 0.000},
    {0.143, 0.143, 0.143},
    {0.286, 0.286, 0.286},
    {0.429, 0.429, 0.429},
    {0.571, 0.571, 0.571},
    {0.714, 0.714, 0.714},
    {0.857, 0.857, 0.857},
    {0.000, 0.447, 0.741},
    {0.314, 0.717, 0.741},
    {0.50, 0.5, 0}
};

static void draw_objects(const cv::Mat& bgr, const std::vector<Object>& objects, std::string f)
{
    static const char* class_names[] = {
        "person", "bicycle", "car", "motorcycle", "airplane", "bus", "train", "truck", "boat", "traffic light",
        "fire hydrant", "stop sign", "parking meter", "bench", "bird", "cat", "dog", "horse", "sheep", "cow",
        "elephant", "bear", "zebra", "giraffe", "backpack", "umbrella", "handbag", "tie", "suitcase", "frisbee",
        "skis", "snowboard", "sports ball", "kite", "baseball bat", "baseball glove", "skateboard", "surfboard",
        "tennis racket", "bottle", "wine glass", "cup", "fork", "knife", "spoon", "bowl", "banana", "apple",
        "sandwich", "orange", "broccoli", "carrot", "hot dog", "pizza", "donut", "cake", "chair", "couch",
        "potted plant", "bed", "dining table", "toilet", "tv", "laptop", "mouse", "remote", "keyboard", "cell phone",
        "microwave", "oven", "toaster", "sink", "refrigerator", "book", "clock", "vase", "scissors", "teddy bear",
        "hair drier", "toothbrush"
    };

    cv::Mat image = bgr.clone();

    for (size_t i = 0; i < objects.size(); i++)
    {
        const Object& obj = objects[i];

        fprintf(stderr, "%d = %.5f at %.2f %.2f %.2f x %.2f\n", obj.label, obj.prob,
                obj.rect.x, obj.rect.y, obj.rect.width, obj.rect.height);

        cv::Scalar color = cv::Scalar(color_list[obj.label][0], color_list[obj.label][1], color_list[obj.label][2]);
        float c_mean = cv::mean(color)[0];
        cv::Scalar txt_color;
        if (c_mean > 0.5){
            txt_color = cv::Scalar(0, 0, 0);
        }else{
            txt_color = cv::Scalar(255, 255, 255);
        }

        cv::rectangle(image, obj.rect, color * 255, 2);

        char text[256];
        sprintf(text, "%s %.1f%%", class_names[obj.label], obj.prob * 100);

        int baseLine = 0;
        cv::Size label_size = cv::getTextSize(text, cv::FONT_HERSHEY_SIMPLEX, 0.4, 1, &baseLine);

        cv::Scalar txt_bk_color = color * 0.7 * 255;

        int x = obj.rect.x;
        int y = obj.rect.y + 1;
        //int y = obj.rect.y - label_size.height - baseLine;
        if (y > image.rows)
            y = image.rows;
        //if (x + label_size.width > image.cols)
            //x = image.cols - label_size.width;

        cv::rectangle(image, cv::Rect(cv::Point(x, y), cv::Size(label_size.width, label_size.height + baseLine)),
                      txt_bk_color, -1);

        cv::putText(image, text, cv::Point(x, y + label_size.height),
                    cv::FONT_HERSHEY_SIMPLEX, 0.4, txt_color, 1);
    }

    cv::imwrite("det_res.jpg", image);
    fprintf(stderr, "save vis file\n");
    /* cv::imshow("image", image); */
    /* cv::waitKey(0); */
}


void doInference(IExecutionContext& context, float* input, float* output, const int output_size, cv::Size input_shape) {
    const ICudaEngine& engine = context.getEngine();

    // Pointers to input and output device buffers to pass to engine.
    // Engine requires exactly IEngine::getNbBindings() number of buffers.
    assert(engine.getNbBindings() == 2);
    void* buffers[2];

    // In order to bind the buffers, we need to know the names of the input and output tensors.
    // Note that indices are guaranteed to be less than IEngine::getNbBindings()
    const int inputIndex = engine.getBindingIndex(INPUT_BLOB_NAME);

    assert(engine.getBindingDataType(inputIndex) == nvinfer1::DataType::kFLOAT);
    const int outputIndex = engine.getBindingIndex(OUTPUT_BLOB_NAME);
    assert(engine.getBindingDataType(outputIndex) == nvinfer1::DataType::kFLOAT);
    int mBatchSize = engine.getMaxBatchSize();

    // Create GPU buffers on device
    CHECK(cudaMalloc(&buffers[inputIndex], 3 * input_shape.height * input_shape.width * sizeof(float)));
    CHECK(cudaMalloc(&buffers[outputIndex], output_size*sizeof(float)));

    // Create stream
    cudaStream_t stream;
    CHECK(cudaStreamCreate(&stream));

    // DMA input batch data to device, infer on the batch asynchronously, and DMA output back to host
    CHECK(cudaMemcpyAsync(buffers[inputIndex], input, 3 * input_shape.height * input_shape.width * sizeof(float), cudaMemcpyHostToDevice, stream));
    context.enqueue(1, buffers, stream, nullptr);
    CHECK(cudaMemcpyAsync(output, buffers[outputIndex], output_size * sizeof(float), cudaMemcpyDeviceToHost, stream));
    cudaStreamSynchronize(stream);

    // Release stream and buffers
    cudaStreamDestroy(stream);
    CHECK(cudaFree(buffers[inputIndex]));
    CHECK(cudaFree(buffers[outputIndex]));
}

int main(int argc, char** argv) {
    cudaSetDevice(DEVICE);
    // create a model using the API directly and serialize it to a stream
    char *trtModelStream{nullptr};
    size_t size{0};

    if (argc == 4 && std::string(argv[2]) == "-i") {
        const std::string engine_file_path {argv[1]};
        std::ifstream file(engine_file_path, std::ios::binary);
        if (file.good()) {
            file.seekg(0, file.end);
            size = file.tellg();
            file.seekg(0, file.beg);
            trtModelStream = new char[size];
            assert(trtModelStream);
            file.read(trtModelStream, size);
            file.close();
        }
    } else {
        std::cerr << "arguments not right!" << std::endl;
        std::cerr << "run 'python3 yolox/deploy/trt.py -n yolox-{tiny, s, m, l, x}' to serialize model first!" << std::endl;
        std::cerr << "Then use the following command:" << std::endl;
        std::cerr << "./yolox ../model_trt.engine -i ../../../assets/dog.jpg  // deserialize file and run inference" << std::endl;
        return -1;
    }
    const std::string input_image_path {argv[3]};

    //std::vector<std::string> file_names;
    //if (read_files_in_dir(argv[2], file_names) < 0) {
        //std::cout << "read_files_in_dir failed." << std::endl;
        //return -1;
    //}

    IRuntime* runtime = createInferRuntime(gLogger);
    assert(runtime != nullptr);
    ICudaEngine* engine = runtime->deserializeCudaEngine(trtModelStream, size);
    assert(engine != nullptr); 
    IExecutionContext* context = engine->createExecutionContext();
    assert(context != nullptr);
    delete[] trtModelStream;
    auto out_dims = engine->getBindingDimensions(1);
    auto output_size = 1;
    for(int j=0;j<out_dims.nbDims;j++) {
        output_size *= out_dims.d[j];
    }
    static float* prob = new float[output_size];

    cv::Mat img = cv::imread(input_image_path);
    int img_w = img.cols;
    int img_h = img.rows;
    cv::Mat pr_img = static_resize(img);
    std::cout << "blob image" << std::endl;

    float* blob;
    blob = blobFromImage(pr_img);
    float scale = std::min(INPUT_W / (img.cols*1.0), INPUT_H / (img.rows*1.0));

    // run inference
    auto start = std::chrono::system_clock::now();
    doInference(*context, blob, prob, output_size, pr_img.size());
    auto end = std::chrono::system_clock::now();
    std::cout << std::chrono::duration_cast<std::chrono::milliseconds>(end - start).count() << "ms" << std::endl;

    std::vector<Object> objects;
    decode_outputs(prob, objects, scale, img_w, img_h);
    draw_objects(img, objects, input_image_path);
    // delete the pointer to the float
    delete blob;
    // destroy the engine
    context->destroy();
    engine->destroy();
    runtime->destroy();
    return 0;
}


================================================
FILE: demo/TensorRT/python/README.md
================================================
# YOLOX-TensorRT in Python

This tutorial includes a Python demo for TensorRT.

## Install TensorRT Toolkit

Please follow the [TensorRT Installation Guide](https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html) and [torch2trt gitrepo](https://github.com/NVIDIA-AI-IOT/torch2trt) to install TensorRT and torch2trt.

## Convert model

YOLOX models can be easily conveted to TensorRT models using torch2trt

   If you want to convert our model, use the flag -n to specify a model name:
   ```shell
   python tools/trt.py -n <YOLOX_MODEL_NAME> -c <YOLOX_CHECKPOINT>
   ```
   For example:
   ```shell
   python tools/trt.py -n yolox-s -c your_ckpt.pth
   ```
   <YOLOX_MODEL_NAME> can be: yolox-nano, yolox-tiny. yolox-s, yolox-m, yolox-l, yolox-x.

   If you want to convert your customized model, use the flag -f to specify you exp file:
   ```shell
   python tools/trt.py -f <YOLOX_EXP_FILE> -c <YOLOX_CHECKPOINT>
   ```
   For example:
   ```shell
   python tools/trt.py -f /path/to/your/yolox/exps/yolox_s.py -c your_ckpt.pth
   ```
   *yolox_s.py* can be any exp file modified by you.

The converted model and the serialized engine file (for C++ demo) will be saved on your experiment output dir.  

## Demo

The TensorRT python demo is merged on our pytorch demo file, so you can run the pytorch demo command with ```--trt```.

```shell
python tools/demo.py image -n yolox-s --trt --save_result
```
or
```shell
python tools/demo.py image -f exps/default/yolox_s.py --trt --save_result
```



================================================
FILE: demo/ncnn/README.md
================================================
# YOLOX-ncnn

Compile files of YOLOX object detection base on [ncnn](https://github.com/Tencent/ncnn).  
YOLOX is included in ncnn now, you could also try building from ncnn, it's better.

## Acknowledgement

* [ncnn](https://github.com/Tencent/ncnn)


================================================
FILE: demo/ncnn/android/README.md
================================================
# YOLOX-Android-ncnn

Andoird app of YOLOX object detection base on [ncnn](https://github.com/Tencent/ncnn)


## Tutorial

### Step1

Download ncnn-android-vulkan.zip from [releases of ncnn](https://github.com/Tencent/ncnn/releases). This repo uses
[20210525 release](https://github.com/Tencent/ncnn/releases/download/20210525/ncnn-20210525-android-vulkan.zip) for building.

### Step2

After downloading, please extract your zip file. Then, there are two ways to finish this step:
* put your extracted directory into **app/src/main/jni**
* change the **ncnn_DIR** path in **app/src/main/jni/CMakeLists.txt** to your extracted directory

### Step3
Download example param and bin file from [onedrive](https://megvii-my.sharepoint.cn/:u:/g/personal/gezheng_megvii_com/ESXBH_GSSmFMszWJ6YG2VkQB5cWDfqVWXgk0D996jH0rpQ?e=qzEqUh) or [github](https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_s_ncnn.tar.gz). Unzip the file to **app/src/main/assets**.

### Step4
Open this project with Android Studio, build it and enjoy!

## Reference

* [ncnn-android-yolov5](https://github.com/nihui/ncnn-android-yolov5)


================================================
FILE: demo/ncnn/android/app/build.gradle
================================================
apply plugin: 'com.android.application'

android {
    compileSdkVersion 24
    buildToolsVersion "29.0.2"

    defaultConfig {
        applicationId "com.megvii.yoloXncnn"
        archivesBaseName = "$applicationId"

        ndk {
            moduleName "ncnn"
            abiFilters "armeabi-v7a", "arm64-v8a"
        }
        minSdkVersion 24
    }

    externalNativeBuild {
        cmake {
            version "3.10.2"
            path file('src/main/jni/CMakeLists.txt')
        }
    }
}


================================================
FILE: demo/ncnn/android/app/src/main/AndroidManifest.xml
================================================
<?xml version="1.0" encoding="utf-8"?>
<manifest xmlns:android="http://schemas.android.com/apk/res/android"
      package="com.megvii.yoloXncnn"
      android:versionCode="1"
      android:versionName="1.1">
    <application android:label="@string/app_name" >
        <activity android:name="MainActivity"
                  android:label="@string/app_name">
            <intent-filter>
                <action android:name="android.intent.action.MAIN" />
                <category android:name="android.intent.category.LAUNCHER" />
            </intent-filter>
        </activity>
    </application>
</manifest> 


================================================
FILE: demo/ncnn/android/app/src/main/assets/yolox.param
================================================
7767517
220 250
Input                    images                   0 1 images
YoloV5Focus              focus                    1 1 images 503
Convolution              Conv_41                  1 1 503 877 0=32 1=3 4=1 5=1 6=3456
Swish                    Mul_43                   1 1 877 507
Convolution              Conv_44                  1 1 507 880 0=64 1=3 3=2 4=1 5=1 6=18432
Swish                    Mul_46                   1 1 880 511
Split                    splitncnn_0              1 2 511 511_splitncnn_0 511_splitncnn_1
Convolution              Conv_47                  1 1 511_splitncnn_1 883 0=32 1=1 5=1 6=2048
Swish                    Mul_49                   1 1 883 515
Split                    splitncnn_1              1 2 515 515_splitncnn_0 515_splitncnn_1
Convolution              Conv_50                  1 1 511_splitncnn_0 886 0=32 1=1 5=1 6=2048
Swish                    Mul_52                   1 1 886 519
Convolution              Conv_53                  1 1 515_splitncnn_1 889 0=32 1=1 5=1 6=1024
Swish                    Mul_55                   1 1 889 523
Convolution              Conv_56                  1 1 523 892 0=32 1=3 4=1 5=1 6=9216
Swish                    Mul_58                   1 1 892 527
BinaryOp                 Add_59                   2 1 527 515_splitncnn_0 528
Concat                   Concat_60                2 1 528 519 529
Convolution              Conv_61                  1 1 529 895 0=64 1=1 5=1 6=4096
Swish                    Mul_63                   1 1 895 533
Convolution              Conv_64                  1 1 533 898 0=128 1=3 3=2 4=1 5=1 6=73728
Swish                    Mul_66                   1 1 898 537
Split                    splitncnn_2              1 2 537 537_splitncnn_0 537_splitncnn_1
Convolution              Conv_67                  1 1 537_splitncnn_1 901 0=64 1=1 5=1 6=8192
Swish                    Mul_69                   1 1 901 541
Split                    splitncnn_3              1 2 541 541_splitncnn_0 541_splitncnn_1
Convolution              Conv_70                  1 1 537_splitncnn_0 904 0=64 1=1 5=1 6=8192
Swish                    Mul_72                   1 1 904 545
Convolution              Conv_73                  1 1 541_splitncnn_1 907 0=64 1=1 5=1 6=4096
Swish                    Mul_75                   1 1 907 549
Convolution              Conv_76                  1 1 549 910 0=64 1=3 4=1 5=1 6=36864
Swish                    Mul_78                   1 1 910 553
BinaryOp                 Add_79                   2 1 553 541_splitncnn_0 554
Split                    splitncnn_4              1 2 554 554_splitncnn_0 554_splitncnn_1
Convolution              Conv_80                  1 1 554_splitncnn_1 913 0=64 1=1 5=1 6=4096
Swish                    Mul_82                   1 1 913 558
Convolution              Conv_83                  1 1 558 916 0=64 1=3 4=1 5=1 6=36864
Swish                    Mul_85                   1 1 916 562
BinaryOp                 Add_86                   2 1 562 554_splitncnn_0 563
Split                    splitncnn_5              1 2 563 563_splitncnn_0 563_splitncnn_1
Convolution              Conv_87                  1 1 563_splitncnn_1 919 0=64 1=1 5=1 6=4096
Swish                    Mul_89                   1 1 919 567
Convolution              Conv_90                  1 1 567 922 0=64 1=3 4=1 5=1 6=36864
Swish                    Mul_92                   1 1 922 571
BinaryOp                 Add_93                   2 1 571 563_splitncnn_0 572
Concat                   Concat_94                2 1 572 545 573
Convolution              Conv_95                  1 1 573 925 0=128 1=1 5=1 6=16384
Swish                    Mul_97                   1 1 925 577
Split                    splitncnn_6              1 2 577 577_splitncnn_0 577_splitncnn_1
Convolution              Conv_98                  1 1 577_splitncnn_1 928 0=256 1=3 3=2 4=1 5=1 6=294912
Swish                    Mul_100                  1 1 928 581
Split                    splitncnn_7              1 2 581 581_splitncnn_0 581_splitncnn_1
Convolution              Conv_101                 1 1 581_splitncnn_1 931 0=128 1=1 5=1 6=32768
Swish                    Mul_103                  1 1 931 585
Split                    splitncnn_8              1 2 585 585_splitncnn_0 585_splitncnn_1
Convolution              Conv_104                 1 1 581_splitncnn_0 934 0=128 1=1 5=1 6=32768
Swish                    Mul_106                  1 1 934 589
Convolution              Conv_107                 1 1 585_splitncnn_1 937 0=128 1=1 5=1 6=16384
Swish                    Mul_109                  1 1 937 593
Convolution              Conv_110                 1 1 593 940 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_112                  1 1 940 597
BinaryOp                 Add_113                  2 1 597 585_splitncnn_0 598
Split                    splitncnn_9              1 2 598 598_splitncnn_0 598_splitncnn_1
Convolution              Conv_114                 1 1 598_splitncnn_1 943 0=128 1=1 5=1 6=16384
Swish                    Mul_116                  1 1 943 602
Convolution              Conv_117                 1 1 602 946 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_119                  1 1 946 606
BinaryOp                 Add_120                  2 1 606 598_splitncnn_0 607
Split                    splitncnn_10             1 2 607 607_splitncnn_0 607_splitncnn_1
Convolution              Conv_121                 1 1 607_splitncnn_1 949 0=128 1=1 5=1 6=16384
Swish                    Mul_123                  1 1 949 611
Convolution              Conv_124                 1 1 611 952 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_126                  1 1 952 615
BinaryOp                 Add_127                  2 1 615 607_splitncnn_0 616
Concat                   Concat_128               2 1 616 589 617
Convolution              Conv_129                 1 1 617 955 0=256 1=1 5=1 6=65536
Swish                    Mul_131                  1 1 955 621
Split                    splitncnn_11             1 2 621 621_splitncnn_0 621_splitncnn_1
Convolution              Conv_132                 1 1 621_splitncnn_1 958 0=512 1=3 3=2 4=1 5=1 6=1179648
Swish                    Mul_134                  1 1 958 625
Convolution              Conv_135                 1 1 625 961 0=256 1=1 5=1 6=131072
Swish                    Mul_137                  1 1 961 629
Split                    splitncnn_12             1 4 629 629_splitncnn_0 629_splitncnn_1 629_splitncnn_2 629_splitncnn_3
Pooling                  MaxPool_138              1 1 629_splitncnn_3 630 1=5 3=2 5=1
Pooling                  MaxPool_139              1 1 629_splitncnn_2 631 1=9 3=4 5=1
Pooling                  MaxPool_140              1 1 629_splitncnn_1 632 1=13 3=6 5=1
Concat                   Concat_141               4 1 629_splitncnn_0 630 631 632 633
Convolution              Conv_142                 1 1 633 964 0=512 1=1 5=1 6=524288
Swish                    Mul_144                  1 1 964 637
Split                    splitncnn_13             1 2 637 637_splitncnn_0 637_splitncnn_1
Convolution              Conv_145                 1 1 637_splitncnn_1 967 0=256 1=1 5=1 6=131072
Swish                    Mul_147                  1 1 967 641
Convolution              Conv_148                 1 1 637_splitncnn_0 970 0=256 1=1 5=1 6=131072
Swish                    Mul_150                  1 1 970 645
Convolution              Conv_151                 1 1 641 973 0=256 1=1 5=1 6=65536
Swish                    Mul_153                  1 1 973 649
Convolution              Conv_154                 1 1 649 976 0=256 1=3 4=1 5=1 6=589824
Swish                    Mul_156                  1 1 976 653
Concat                   Concat_157               2 1 653 645 654
Convolution              Conv_158                 1 1 654 979 0=512 1=1 5=1 6=262144
Swish                    Mul_160                  1 1 979 658
Convolution              Conv_161                 1 1 658 982 0=256 1=1 5=1 6=131072
Swish                    Mul_163                  1 1 982 662
Split                    splitncnn_14             1 2 662 662_splitncnn_0 662_splitncnn_1
Interp                   Resize_165               1 1 662_splitncnn_1 667 0=1 1=2.000000e+00 2=2.000000e+00
Concat                   Concat_166               2 1 667 621_splitncnn_0 668
Split                    splitncnn_15             1 2 668 668_splitncnn_0 668_splitncnn_1
Convolution              Conv_167                 1 1 668_splitncnn_1 985 0=128 1=1 5=1 6=65536
Swish                    Mul_169                  1 1 985 672
Convolution              Conv_170                 1 1 668_splitncnn_0 988 0=128 1=1 5=1 6=65536
Swish                    Mul_172                  1 1 988 676
Convolution              Conv_173                 1 1 672 991 0=128 1=1 5=1 6=16384
Swish                    Mul_175                  1 1 991 680
Convolution              Conv_176                 1 1 680 994 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_178                  1 1 994 684
Concat                   Concat_179               2 1 684 676 685
Convolution              Conv_180                 1 1 685 997 0=256 1=1 5=1 6=65536
Swish                    Mul_182                  1 1 997 689
Convolution              Conv_183                 1 1 689 1000 0=128 1=1 5=1 6=32768
Swish                    Mul_185                  1 1 1000 693
Split                    splitncnn_16             1 2 693 693_splitncnn_0 693_splitncnn_1
Interp                   Resize_187               1 1 693_splitncnn_1 698 0=1 1=2.000000e+00 2=2.000000e+00
Concat                   Concat_188               2 1 698 577_splitncnn_0 699
Split                    splitncnn_17             1 2 699 699_splitncnn_0 699_splitncnn_1
Convolution              Conv_189                 1 1 699_splitncnn_1 1003 0=64 1=1 5=1 6=16384
Swish                    Mul_191                  1 1 1003 703
Convolution              Conv_192                 1 1 699_splitncnn_0 1006 0=64 1=1 5=1 6=16384
Swish                    Mul_194                  1 1 1006 707
Convolution              Conv_195                 1 1 703 1009 0=64 1=1 5=1 6=4096
Swish                    Mul_197                  1 1 1009 711
Convolution              Conv_198                 1 1 711 1012 0=64 1=3 4=1 5=1 6=36864
Swish                    Mul_200                  1 1 1012 715
Concat                   Concat_201               2 1 715 707 716
Convolution              Conv_202                 1 1 716 1015 0=128 1=1 5=1 6=16384
Swish                    Mul_204                  1 1 1015 720
Split                    splitncnn_18             1 2 720 720_splitncnn_0 720_splitncnn_1
Convolution              Conv_205                 1 1 720_splitncnn_1 1018 0=128 1=3 3=2 4=1 5=1 6=147456
Swish                    Mul_207                  1 1 1018 724
Concat                   Concat_208               2 1 724 693_splitncnn_0 725
Split                    splitncnn_19             1 2 725 725_splitncnn_0 725_splitncnn_1
Convolution              Conv_209                 1 1 725_splitncnn_1 1021 0=128 1=1 5=1 6=32768
Swish                    Mul_211                  1 1 1021 729
Convolution              Conv_212                 1 1 725_splitncnn_0 1024 0=128 1=1 5=1 6=32768
Swish                    Mul_214                  1 1 1024 733
Convolution              Conv_215                 1 1 729 1027 0=128 1=1 5=1 6=16384
Swish                    Mul_217                  1 1 1027 737
Convolution              Conv_218                 1 1 737 1030 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_220                  1 1 1030 741
Concat                   Concat_221               2 1 741 733 742
Convolution              Conv_222                 1 1 742 1033 0=256 1=1 5=1 6=65536
Swish                    Mul_224                  1 1 1033 746
Split                    splitncnn_20             1 2 746 746_splitncnn_0 746_splitncnn_1
Convolution              Conv_225                 1 1 746_splitncnn_1 1036 0=256 1=3 3=2 4=1 5=1 6=589824
Swish                    Mul_227                  1 1 1036 750
Concat                   Concat_228               2 1 750 662_splitncnn_0 751
Split                    splitncnn_21             1 2 751 751_splitncnn_0 751_splitncnn_1
Convolution              Conv_229                 1 1 751_splitncnn_1 1039 0=256 1=1 5=1 6=131072
Swish                    Mul_231                  1 1 1039 755
Convolution              Conv_232                 1 1 751_splitncnn_0 1042 0=256 1=1 5=1 6=131072
Swish                    Mul_234                  1 1 1042 759
Convolution              Conv_235                 1 1 755 1045 0=256 1=1 5=1 6=65536
Swish                    Mul_237                  1 1 1045 763
Convolution              Conv_238                 1 1 763 1048 0=256 1=3 4=1 5=1 6=589824
Swish                    Mul_240                  1 1 1048 767
Concat                   Concat_241               2 1 767 759 768
Convolution              Conv_242                 1 1 768 1051 0=512 1=1 5=1 6=262144
Swish                    Mul_244                  1 1 1051 772
Convolution              Conv_245                 1 1 720_splitncnn_0 1054 0=128 1=1 5=1 6=16384
Swish                    Mul_247                  1 1 1054 776
Split                    splitncnn_22             1 2 776 776_splitncnn_0 776_splitncnn_1
Convolution              Conv_248                 1 1 776_splitncnn_1 1057 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_250                  1 1 1057 780
Convolution              Conv_251                 1 1 780 1060 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_253                  1 1 1060 784
Convolution              Conv_254                 1 1 784 797 0=80 1=1 5=1 6=10240 9=4
Convolution              Conv_255                 1 1 776_splitncnn_0 1063 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_257                  1 1 1063 789
Convolution              Conv_258                 1 1 789 1066 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_260                  1 1 1066 793
Split                    splitncnn_23             1 2 793 793_splitncnn_0 793_splitncnn_1
Convolution              Conv_261                 1 1 793_splitncnn_1 794 0=4 1=1 5=1 6=512
Convolution              Conv_262                 1 1 793_splitncnn_0 796 0=1 1=1 5=1 6=128 9=4
Concat                   Concat_265               3 1 794 796 797 798
Convolution              Conv_266                 1 1 746_splitncnn_0 1069 0=128 1=1 5=1 6=32768
Swish                    Mul_268                  1 1 1069 802
Split                    splitncnn_24             1 2 802 802_splitncnn_0 802_splitncnn_1
Convolution              Conv_269                 1 1 802_splitncnn_1 1072 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_271                  1 1 1072 806
Convolution              Conv_272                 1 1 806 1075 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_274                  1 1 1075 810
Convolution              Conv_275                 1 1 810 823 0=80 1=1 5=1 6=10240 9=4
Convolution              Conv_276                 1 1 802_splitncnn_0 1078 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_278                  1 1 1078 815
Convolution              Conv_279                 1 1 815 1081 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_281                  1 1 1081 819
Split                    splitncnn_25             1 2 819 819_splitncnn_0 819_splitncnn_1
Convolution              Conv_282                 1 1 819_splitncnn_1 820 0=4 1=1 5=1 6=512
Convolution              Conv_283                 1 1 819_splitncnn_0 822 0=1 1=1 5=1 6=128 9=4
Concat                   Concat_286               3 1 820 822 823 824
Convolution              Conv_287                 1 1 772 1084 0=128 1=1 5=1 6=65536
Swish                    Mul_289                  1 1 1084 828
Split                    splitncnn_26             1 2 828 828_splitncnn_0 828_splitncnn_1
Convolution              Conv_290                 1 1 828_splitncnn_1 1087 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_292                  1 1 1087 832
Convolution              Conv_293                 1 1 832 1090 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_295                  1 1 1090 836
Convolution              Conv_296                 1 1 836 849 0=80 1=1 5=1 6=10240 9=4
Convolution              Conv_297                 1 1 828_splitncnn_0 1093 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_299                  1 1 1093 841
Convolution              Conv_300                 1 1 841 1096 0=128 1=3 4=1 5=1 6=147456
Swish                    Mul_302                  1 1 1096 845
Split                    splitncnn_27             1 2 845 845_splitncnn_0 845_splitncnn_1
Convolution              Conv_303                 1 1 845_splitncnn_1 846 0=4 1=1 5=1 6=512
Convolution              Conv_304                 1 1 845_splitncnn_0 848 0=1 1=1 5=1 6=128 9=4
Concat                   Concat_307               3 1 846 848 849 850
Reshape                  Reshape_315              1 1 798 858 0=-1 1=85
Reshape                  Reshape_323              1 1 824 866 0=-1 1=85
Reshape                  Reshape_331              1 1 850 874 0=-1 1=85
Concat                   Concat_332               3 1 858 866 874 875 0=1
Permute                  Transpose_333            1 1 875 output 0=1


================================================
FILE: demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/MainActivity.java
================================================
// Some code in this file is based on:
// https://github.com/nihui/ncnn-android-yolov5/blob/master/app/src/main/java/com/tencent/yolov5ncnn/MainActivity.java
// Copyright (C) 2020 THL A29 Limited, a Tencent company. All rights reserved.
// Copyright (C) Megvii, Inc. and its affiliates. All rights reserved.

package com.megvii.yoloXncnn;

import android.app.Activity;
import android.content.Intent;
import android.graphics.Bitmap;
import android.graphics.BitmapFactory;
import android.graphics.Canvas;
import android.graphics.Color;
import android.graphics.Paint;
import android.media.ExifInterface;
import android.graphics.Matrix;
import android.net.Uri;
import android.os.Bundle;
import android.util.Log;
import android.view.View;
import android.widget.Button;
import android.widget.ImageView;

import java.io.FileNotFoundException;
import java.io.InputStream;
import java.io.IOException;

public class MainActivity extends Activity
{
    private static final int SELECT_IMAGE = 1;

    private ImageView imageView;
    private Bitmap bitmap = null;
    private Bitmap yourSelectedImage = null;

    private YOLOXncnn yoloX = new YOLOXncnn();

    /** Called when the activity is first created. */
    @Override
    public void onCreate(Bundle savedInstanceState)
    {
        super.onCreate(savedInstanceState);
        setContentView(R.layout.main);

        boolean ret_init = yoloX.Init(getAssets());
        if (!ret_init)
        {
            Log.e("MainActivity", "yoloXncnn Init failed");
        }

        imageView = (ImageView) findViewById(R.id.imageView);

        Button buttonImage = (Button) findViewById(R.id.buttonImage);
        buttonImage.setOnClickListener(new View.OnClickListener() {
            @Override
            public void onClick(View arg0) {
                Intent i = new Intent(Intent.ACTION_PICK);
                i.setType("image/*");
                startActivityForResult(i, SELECT_IMAGE);
            }
        });

        Button buttonDetect = (Button) findViewById(R.id.buttonDetect);
        buttonDetect.setOnClickListener(new View.OnClickListener() {
            @Override
            public void onClick(View arg0) {
                if (yourSelectedImage == null)
                    return;
                YOLOXncnn.Obj[] objects = yoloX.Detect(yourSelectedImage, false);

                showObjects(objects);
            }
        });

        Button buttonDetectGPU = (Button) findViewById(R.id.buttonDetectGPU);
        buttonDetectGPU.setOnClickListener(new View.OnClickListener() {
            @Override
            public void onClick(View arg0) {
                if (yourSelectedImage == null)
                    return;

                YOLOXncnn.Obj[] objects = yoloX.Detect(yourSelectedImage, true);

                showObjects(objects);
            }
        });
    }

    private void showObjects(YOLOXncnn.Obj[] objects)
    {
        if (objects == null)
        {
            imageView.setImageBitmap(bitmap);
            return;
        }

        // draw objects on bitmap
        Bitmap rgba = bitmap.copy(Bitmap.Config.ARGB_8888, true);

        final int[] colors = new int[] {
            Color.rgb( 54,  67, 244),
            Color.rgb( 99,  30, 233),
            Color.rgb(176,  39,

Download .txt

gitextract_i4lyl_zt/

├── .github/
│   └── workflows/
│       ├── ci.yaml
│       └── format_check.sh
├── .gitignore
├── .pre-commit-config.yaml
├── .readthedocs.yaml
├── LICENSE
├── MANIFEST.in
├── README.md
├── SECURITY.md
├── demo/
│   ├── MegEngine/
│   │   ├── cpp/
│   │   │   ├── README.md
│   │   │   ├── build.sh
│   │   │   └── yolox.cpp
│   │   └── python/
│   │       ├── README.md
│   │       ├── build.py
│   │       ├── convert_weights.py
│   │       ├── demo.py
│   │       ├── dump.py
│   │       └── models/
│   │           ├── __init__.py
│   │           ├── darknet.py
│   │           ├── network_blocks.py
│   │           ├── yolo_fpn.py
│   │           ├── yolo_head.py
│   │           ├── yolo_pafpn.py
│   │           └── yolox.py
│   ├── ONNXRuntime/
│   │   ├── README.md
│   │   └── onnx_inference.py
│   ├── OpenVINO/
│   │   ├── README.md
│   │   ├── cpp/
│   │   │   ├── CMakeLists.txt
│   │   │   ├── README.md
│   │   │   └── yolox_openvino.cpp
│   │   └── python/
│   │       ├── README.md
│   │       └── openvino_inference.py
│   ├── TensorRT/
│   │   ├── cpp/
│   │   │   ├── CMakeLists.txt
│   │   │   ├── README.md
│   │   │   ├── logging.h
│   │   │   └── yolox.cpp
│   │   └── python/
│   │       └── README.md
│   ├── ncnn/
│   │   ├── README.md
│   │   ├── android/
│   │   │   ├── README.md
│   │   │   ├── app/
│   │   │   │   ├── build.gradle
│   │   │   │   └── src/
│   │   │   │       └── main/
│   │   │   │           ├── AndroidManifest.xml
│   │   │   │           ├── assets/
│   │   │   │           │   └── yolox.param
│   │   │   │           ├── java/
│   │   │   │           │   └── com/
│   │   │   │           │       └── megvii/
│   │   │   │           │           └── yoloXncnn/
│   │   │   │           │               ├── MainActivity.java
│   │   │   │           │               ├── YOLOXncnn.java
│   │   │   │           │               └── yoloXncnn.java
│   │   │   │           ├── jni/
│   │   │   │           │   ├── CMakeLists.txt
│   │   │   │           │   └── yoloXncnn_jni.cpp
│   │   │   │           └── res/
│   │   │   │               ├── layout/
│   │   │   │               │   └── main.xml
│   │   │   │               └── values/
│   │   │   │                   └── strings.xml
│   │   │   ├── build.gradle
│   │   │   ├── gradle/
│   │   │   │   └── wrapper/
│   │   │   │       ├── gradle-wrapper.jar
│   │   │   │       └── gradle-wrapper.properties
│   │   │   ├── gradlew
│   │   │   ├── gradlew.bat
│   │   │   └── settings.gradle
│   │   └── cpp/
│   │       ├── README.md
│   │       └── yolox.cpp
│   └── nebullvm/
│       ├── README.md
│       └── nebullvm_optimization.py
├── docs/
│   ├── .gitignore
│   ├── Makefile
│   ├── _static/
│   │   └── css/
│   │       └── custom.css
│   ├── assignment_visualization.md
│   ├── cache.md
│   ├── conf.py
│   ├── freeze_module.md
│   ├── index.rst
│   ├── manipulate_training_image_size.md
│   ├── mlflow_integration.md
│   ├── model_zoo.md
│   ├── quick_run.md
│   ├── requirements-doc.txt
│   ├── train_custom_data.md
│   └── updates_note.md
├── exps/
│   ├── default/
│   │   ├── __init__.py
│   │   ├── yolov3.py
│   │   ├── yolox_l.py
│   │   ├── yolox_m.py
│   │   ├── yolox_nano.py
│   │   ├── yolox_s.py
│   │   ├── yolox_tiny.py
│   │   └── yolox_x.py
│   └── example/
│       ├── custom/
│       │   ├── nano.py
│       │   └── yolox_s.py
│       └── yolox_voc/
│           └── yolox_voc_s.py
├── hubconf.py
├── requirements.txt
├── setup.cfg
├── setup.py
├── tests/
│   ├── __init__.py
│   └── utils/
│       └── test_model_utils.py
├── tools/
│   ├── __init__.py
│   ├── demo.py
│   ├── eval.py
│   ├── export_onnx.py
│   ├── export_torchscript.py
│   ├── train.py
│   ├── trt.py
│   └── visualize_assign.py
└── yolox/
    ├── __init__.py
    ├── core/
    │   ├── __init__.py
    │   ├── launch.py
    │   └── trainer.py
    ├── data/
    │   ├── __init__.py
    │   ├── data_augment.py
    │   ├── data_prefetcher.py
    │   ├── dataloading.py
    │   ├── datasets/
    │   │   ├── __init__.py
    │   │   ├── coco.py
    │   │   ├── coco_classes.py
    │   │   ├── datasets_wrapper.py
    │   │   ├── mosaicdetection.py
    │   │   ├── voc.py
    │   │   └── voc_classes.py
    │   └── samplers.py
    ├── evaluators/
    │   ├── __init__.py
    │   ├── coco_evaluator.py
    │   ├── voc_eval.py
    │   └── voc_evaluator.py
    ├── exp/
    │   ├── __init__.py
    │   ├── base_exp.py
    │   ├── build.py
    │   ├── default/
    │   │   └── __init__.py
    │   └── yolox_base.py
    ├── layers/
    │   ├── __init__.py
    │   ├── cocoeval/
    │   │   ├── cocoeval.cpp
    │   │   └── cocoeval.h
    │   ├── fast_coco_eval_api.py
    │   └── jit_ops.py
    ├── models/
    │   ├── __init__.py
    │   ├── build.py
    │   ├── darknet.py
    │   ├── losses.py
    │   ├── network_blocks.py
    │   ├── yolo_fpn.py
    │   ├── yolo_head.py
    │   ├── yolo_pafpn.py
    │   └── yolox.py
    ├── tools/
    │   └── __init__.py
    └── utils/
        ├── __init__.py
        ├── allreduce_norm.py
        ├── boxes.py
        ├── checkpoint.py
        ├── compat.py
        ├── demo_utils.py
        ├── dist.py
        ├── ema.py
        ├── logger.py
        ├── lr_scheduler.py
        ├── metric.py
        ├── mlflow_logger.py
        ├── model_utils.py
        ├── setup_env.py
        └── visualize.py

Download .txt

SYMBOL INDEX (601 symbols across 85 files)

FILE: demo/MegEngine/cpp/yolox.cpp
  function static_resize (line 26) | cv::Mat static_resize(cv::Mat &img) {
  function blobFromImage (line 37) | void blobFromImage(cv::Mat &img, float *blob_data) {
  type Object (line 51) | struct Object {
  type GridAndStride (line 57) | struct GridAndStride {
  function generate_grids_and_stride (line 63) | static void
  function generate_yolox_proposals (line 76) | static void generate_yolox_proposals(std::vector<GridAndStride> grid_str...
  function intersection_area (line 118) | static inline float intersection_area(const Object &a, const Object &b) {
  function qsort_descent_inplace (line 123) | static void qsort_descent_inplace(std::vector<Object> &faceobjects, int ...
  function qsort_descent_inplace (line 160) | static void qsort_descent_inplace(std::vector<Object> &objects) {
  function nms_sorted_bboxes (line 167) | static void nms_sorted_bboxes(const std::vector<Object> &faceobjects,
  function decode_outputs (line 198) | static void decode_outputs(const float *prob, std::vector<Object> &objects,
  function draw_objects (line 264) | static void draw_objects(const cv::Mat &bgr,
  function make_callback_copy (line 347) | cg::ComputingGraph::OutputSpecItem make_callback_copy(SymbolVar dev,
  function main (line 353) | int main(int argc, char *argv[]) {

FILE: demo/MegEngine/python/build.py
  function build_yolox (line 13) | def build_yolox(name="yolox-s"):
  function build_and_load (line 49) | def build_and_load(weight_file, name="yolox-s"):

FILE: demo/MegEngine/python/convert_weights.py
  function make_parser (line 10) | def make_parser():
  function numpy_weights (line 23) | def numpy_weights(weight_file):
  function map_weights (line 33) | def map_weights(weight_file, output_file):
  function main (line 57) | def main():

FILE: demo/MegEngine/python/demo.py
  function make_parser (line 23) | def make_parser():
  function get_image_list (line 44) | def get_image_list(path):
  function postprocess (line 55) | def postprocess(prediction, num_classes, conf_thre=0.7, nms_thre=0.45):
  class Predictor (line 92) | class Predictor(object):
    method __init__ (line 93) | def __init__(
    method inference (line 111) | def inference(self, img):
    method visual (line 136) | def visual(self, output, img_info, cls_conf=0.35):
  function image_demo (line 153) | def image_demo(predictor, vis_folder, path, current_time, save_result):
  function imageflow_demo (line 175) | def imageflow_demo(predictor, vis_folder, current_time, args):
  function main (line 206) | def main(args):

FILE: demo/MegEngine/python/dump.py
  function make_parser (line 14) | def make_parser():
  function dump_static_graph (line 24) | def dump_static_graph(model, graph_name="model.mge"):
  function main (line 44) | def main(args):

FILE: demo/MegEngine/python/models/darknet.py
  class Darknet (line 10) | class Darknet(M.Module):
    method __init__ (line 14) | def __init__(
    method make_group_layer (line 49) | def make_group_layer(self, in_channels: int, num_blocks: int, stride: ...
    method make_spp_block (line 56) | def make_spp_block(self, filters_list, in_filters):
    method forward (line 72) | def forward(self, x):
  class CSPDarknet (line 87) | class CSPDarknet(M.Module):
    method __init__ (line 89) | def __init__(
    method forward (line 142) | def forward(self, x):

FILE: demo/MegEngine/python/models/network_blocks.py
  class UpSample (line 9) | class UpSample(M.Module):
    method __init__ (line 11) | def __init__(self, scale_factor=2, mode="bilinear"):
    method forward (line 16) | def forward(self, x):
  class SiLU (line 20) | class SiLU(M.Module):
    method forward (line 24) | def forward(x):
  function get_activation (line 28) | def get_activation(name="silu"):
  class BaseConv (line 40) | class BaseConv(M.Module):
    method __init__ (line 43) | def __init__(self, in_channels, out_channels, ksize, stride, groups=1,...
    method forward (line 59) | def forward(self, x):
    method fuseforward (line 62) | def fuseforward(self, x):
  class DWConv (line 66) | class DWConv(M.Module):
    method __init__ (line 68) | def __init__(self, in_channels, out_channels, ksize, stride=1, act="si...
    method forward (line 79) | def forward(self, x):
  class Bottleneck (line 84) | class Bottleneck(M.Module):
    method __init__ (line 86) | def __init__(
    method forward (line 97) | def forward(self, x):
  class ResLayer (line 104) | class ResLayer(M.Module):
    method __init__ (line 106) | def __init__(self, in_channels: int):
    method forward (line 112) | def forward(self, x):
  class SPPBottleneck (line 117) | class SPPBottleneck(M.Module):
    method __init__ (line 119) | def __init__(self, in_channels, out_channels, kernel_sizes=(5, 9, 13),...
    method forward (line 127) | def forward(self, x):
  class CSPLayer (line 134) | class CSPLayer(M.Module):
    method __init__ (line 137) | def __init__(
    method forward (line 159) | def forward(self, x):
  class Focus (line 167) | class Focus(M.Module):
    method __init__ (line 170) | def __init__(self, in_channels, out_channels, ksize=1, stride=1, act="...
    method forward (line 174) | def forward(self, x):

FILE: demo/MegEngine/python/models/yolo_fpn.py
  class YOLOFPN (line 12) | class YOLOFPN(M.Module):
    method __init__ (line 17) | def __init__(
    method _make_cbl (line 36) | def _make_cbl(self, _in, _out, ks):
    method _make_embedding (line 39) | def _make_embedding(self, filters_list, in_filters):
    method forward (line 53) | def forward(self, inputs):

FILE: demo/MegEngine/python/models/yolo_head.py
  function meshgrid (line 11) | def meshgrid(x, y):
  class YOLOXHead (line 21) | class YOLOXHead(M.Module):
    method __init__ (line 22) | def __init__(
    method forward (line 127) | def forward(self, xin, labels=None, imgs=None):
    method get_output_and_grid (line 156) | def get_output_and_grid(self, output, k, stride, dtype):
    method decode_outputs (line 177) | def decode_outputs(self, outputs):

FILE: demo/MegEngine/python/models/yolo_pafpn.py
  class YOLOPAFPN (line 12) | class YOLOPAFPN(M.Module):
    method __init__ (line 17) | def __init__(
    method forward (line 78) | def forward(self, input):

FILE: demo/MegEngine/python/models/yolox.py
  class YOLOX (line 11) | class YOLOX(M.Module):
    method __init__ (line 18) | def __init__(self, backbone=None, head=None):
    method forward (line 28) | def forward(self, x):

FILE: demo/ONNXRuntime/onnx_inference.py
  function make_parser (line 17) | def make_parser():

FILE: demo/OpenVINO/cpp/yolox_openvino.cpp
  function static_resize (line 28) | cv::Mat static_resize(cv::Mat& img) {
  function blobFromImage (line 41) | void blobFromImage(cv::Mat& img, Blob::Ptr& blob){
  type Object (line 70) | struct Object
  type GridAndStride (line 77) | struct GridAndStride
  function generate_grids_and_stride (line 84) | static void generate_grids_and_stride(const int target_w, const int targ...
  function generate_yolox_proposals (line 101) | static void generate_yolox_proposals(std::vector<GridAndStride> grid_str...
  function intersection_area (line 147) | static inline float intersection_area(const Object& a, const Object& b)
  function qsort_descent_inplace (line 153) | static void qsort_descent_inplace(std::vector<Object>& faceobjects, int ...
  function qsort_descent_inplace (line 191) | static void qsort_descent_inplace(std::vector<Object>& objects)
  function nms_sorted_bboxes (line 199) | static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, st...
  function decode_outputs (line 234) | static void decode_outputs(const float* prob, std::vector<Object>& objec...
  function draw_objects (line 355) | static void draw_objects(const cv::Mat& bgr, const std::vector<Object>& ...
  function main (line 419) | int main(int argc, char* argv[]) {

FILE: demo/OpenVINO/python/openvino_inference.py
  function parse_args (line 22) | def parse_args() -> argparse.Namespace:
  function main (line 78) | def main():

FILE: demo/TensorRT/cpp/logging.h
  function class (line 31) | class LogStreamConsumerBuffer : public std::stringbuf
  function class (line 106) | class LogStreamConsumerBase
  function std (line 160) | static std::string severityPrefix(Severity severity)
  function TestResult (line 213) | enum class TestResult
  function LogStreamConsumer (line 447) | inline LogStreamConsumer LOG_VERBOSE(const Logger& logger)
  function LogStreamConsumer (line 459) | inline LogStreamConsumer LOG_INFO(const Logger& logger)
  function LogStreamConsumer (line 471) | inline LogStreamConsumer LOG_WARN(const Logger& logger)
  function LogStreamConsumer (line 483) | inline LogStreamConsumer LOG_ERROR(const Logger& logger)
  function LogStreamConsumer (line 496) | inline LogStreamConsumer LOG_FATAL(const Logger& logger)

FILE: demo/TensorRT/cpp/yolox.cpp
  function static_resize (line 38) | cv::Mat static_resize(cv::Mat& img) {
  type Object (line 50) | struct Object
  type GridAndStride (line 57) | struct GridAndStride
  function generate_grids_and_stride (line 64) | static void generate_grids_and_stride(std::vector<int>& strides, std::ve...
  function intersection_area (line 80) | static inline float intersection_area(const Object& a, const Object& b)
  function qsort_descent_inplace (line 86) | static void qsort_descent_inplace(std::vector<Object>& faceobjects, int ...
  function qsort_descent_inplace (line 123) | static void qsort_descent_inplace(std::vector<Object>& objects)
  function nms_sorted_bboxes (line 131) | static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, st...
  function generate_yolox_proposals (line 166) | static void generate_yolox_proposals(std::vector<GridAndStride> grid_str...
  function decode_outputs (line 230) | static void decode_outputs(float* prob, std::vector<Object>& objects, fl...
  function draw_objects (line 356) | static void draw_objects(const cv::Mat& bgr, const std::vector<Object>& ...
  function doInference (line 420) | void doInference(IExecutionContext& context, float* input, float* output...
  function main (line 457) | int main(int argc, char** argv) {

FILE: demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/MainActivity.java
  class MainActivity (line 28) | public class MainActivity extends Activity
    method onCreate (line 39) | @Override
    method showObjects (line 89) | private void showObjects(YOLOXncnn.Obj[] objects)
    method onActivityResult (line 166) | @Override
    method decodeUri (line 192) | private Bitmap decodeUri(Uri selectedImage) throws FileNotFoundException

FILE: demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/YOLOXncnn.java
  class YOLOXncnn (line 8) | public class YOLOXncnn
    method Init (line 10) | public native boolean Init(AssetManager mgr);
    class Obj (line 12) | public class Obj
    method Detect (line 22) | public native Obj[] Detect(Bitmap bitmap, boolean use_gpu);

FILE: demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/yoloXncnn.java
  class YOLOXncnn (line 8) | public class YOLOXncnn
    method Init (line 10) | public native boolean Init(AssetManager mgr);
    class Obj (line 12) | public class Obj
    method Detect (line 22) | public native Obj[] Detect(Bitmap bitmap, boolean use_gpu);

FILE: demo/ncnn/android/app/src/main/jni/yoloXncnn_jni.cpp
  class YoloV5Focus (line 25) | class YoloV5Focus : public ncnn::Layer
    method YoloV5Focus (line 28) | YoloV5Focus()
    method forward (line 33) | virtual int forward(const ncnn::Mat& bottom_blob, ncnn::Mat& top_blob,...
  type Object (line 73) | struct Object
  type GridAndStride (line 83) | struct GridAndStride
  function intersection_area (line 90) | static inline float intersection_area(const Object& a, const Object& b)
  function qsort_descent_inplace (line 104) | static void qsort_descent_inplace(std::vector<Object>& faceobjects, int ...
  function qsort_descent_inplace (line 141) | static void qsort_descent_inplace(std::vector<Object>& faceobjects)
  function nms_sorted_bboxes (line 149) | static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, st...
  function generate_grids_and_stride (line 183) | static void generate_grids_and_stride(const int target_size, std::vector...
  function generate_yolox_proposals (line 198) | static void generate_yolox_proposals(std::vector<GridAndStride> grid_str...
  function JNIEXPORT (line 261) | JNIEXPORT jint JNI_OnLoad(JavaVM* vm, void* reserved)
  function JNIEXPORT (line 270) | JNIEXPORT void JNI_OnUnload(JavaVM* vm, void* reserved)
  function JNIEXPORT (line 278) | JNIEXPORT jboolean JNICALL Java_com_megvii_yoloXncnn_YOLOXncnn_Init(JNIE...
  function JNIEXPORT (line 334) | JNIEXPORT jobjectArray JNICALL Java_com_megvii_yoloXncnn_YOLOXncnn_Detec...

FILE: demo/ncnn/cpp/yolox.cpp
  class YoloV5Focus (line 35) | class YoloV5Focus : public ncnn::Layer
    method YoloV5Focus (line 38) | YoloV5Focus()
    method forward (line 43) | virtual int forward(const ncnn::Mat& bottom_blob, ncnn::Mat& top_blob,...
  type Object (line 83) | struct Object
  type GridAndStride (line 90) | struct GridAndStride
  function intersection_area (line 97) | static inline float intersection_area(const Object& a, const Object& b)
  function qsort_descent_inplace (line 103) | static void qsort_descent_inplace(std::vector<Object>& faceobjects, int ...
  function qsort_descent_inplace (line 140) | static void qsort_descent_inplace(std::vector<Object>& objects)
  function nms_sorted_bboxes (line 148) | static void nms_sorted_bboxes(const std::vector<Object>& faceobjects, st...
  function generate_grids_and_stride (line 182) | static void generate_grids_and_stride(const int target_size, std::vector...
  function generate_yolox_proposals (line 202) | static void generate_yolox_proposals(std::vector<GridAndStride> grid_str...
  function detect_yolox (line 249) | static int detect_yolox(const cv::Mat& bgr, std::vector<Object>& objects)
  function draw_objects (line 344) | static void draw_objects(const cv::Mat& bgr, const std::vector<Object>& ...
  function main (line 393) | int main(int argc, char** argv)

FILE: docs/conf.py
  class GithubURLDomain (line 33) | class GithubURLDomain(Domain):
    method resolve_any_xref (line 43) | def resolve_any_xref(self, env, fromdocname, builder, target, node, co...
  function autodoc_skip_member (line 280) | def autodoc_skip_member(app, what, name, obj, skip, options):
  function setup (line 373) | def setup(app):

FILE: exps/default/yolov3.py
  class Exp (line 12) | class Exp(MyExp):
    method __init__ (line 13) | def __init__(self):
    method get_model (line 19) | def get_model(self, sublinear=False):

FILE: exps/default/yolox_l.py
  class Exp (line 10) | class Exp(MyExp):
    method __init__ (line 11) | def __init__(self):

FILE: exps/default/yolox_m.py
  class Exp (line 10) | class Exp(MyExp):
    method __init__ (line 11) | def __init__(self):

FILE: exps/default/yolox_nano.py
  class Exp (line 12) | class Exp(MyExp):
    method __init__ (line 13) | def __init__(self):
    method get_model (line 25) | def get_model(self, sublinear=False):

FILE: exps/default/yolox_s.py
  class Exp (line 10) | class Exp(MyExp):
    method __init__ (line 11) | def __init__(self):

FILE: exps/default/yolox_tiny.py
  class Exp (line 10) | class Exp(MyExp):
    method __init__ (line 11) | def __init__(self):

FILE: exps/default/yolox_x.py
  class Exp (line 10) | class Exp(MyExp):
    method __init__ (line 11) | def __init__(self):

FILE: exps/example/custom/nano.py
  class Exp (line 12) | class Exp(MyExp):
    method __init__ (line 13) | def __init__(self):
    method get_model (line 31) | def get_model(self, sublinear=False):

FILE: exps/example/custom/yolox_s.py
  class Exp (line 9) | class Exp(MyExp):
    method __init__ (line 10) | def __init__(self):

FILE: exps/example/yolox_voc/yolox_voc_s.py
  class Exp (line 8) | class Exp(MyExp):
    method __init__ (line 9) | def __init__(self):
    method get_dataset (line 24) | def get_dataset(self, cache: bool, cache_type: str = "ram"):
    method get_eval_dataset (line 39) | def get_eval_dataset(self, **kwargs):
    method get_evaluator (line 50) | def get_evaluator(self, batch_size, is_distributed, testdev=False, leg...

FILE: setup.py
  function get_package_dir (line 17) | def get_package_dir():
  function get_install_requirements (line 25) | def get_install_requirements():
  function get_yolox_version (line 32) | def get_yolox_version():
  function get_long_description (line 41) | def get_long_description():
  function get_ext_modules (line 47) | def get_ext_modules():
  function get_cmd_class (line 57) | def get_cmd_class():

FILE: tests/utils/test_model_utils.py
  class TestModelUtils (line 14) | class TestModelUtils(unittest.TestCase):
    method setUp (line 16) | def setUp(self):
    method test_model_state_adjust_status (line 19) | def test_model_state_adjust_status(self):
    method test_model_effect_adjust_status (line 50) | def test_model_effect_adjust_status(self):
    method test_freeze_module (line 72) | def test_freeze_module(self):

FILE: tools/demo.py
  function make_parser (line 22) | def make_parser():
  function get_image_list (line 89) | def get_image_list(path):
  class Predictor (line 100) | class Predictor(object):
    method __init__ (line 101) | def __init__(
    method inference (line 132) | def inference(self, img):
    method visual (line 168) | def visual(self, output, img_info, cls_conf=0.35):
  function image_demo (line 187) | def image_demo(predictor, vis_folder, path, current_time, save_result):
  function imageflow_demo (line 209) | def imageflow_demo(predictor, vis_folder, current_time, args):
  function main (line 244) | def main(exp, args):

FILE: tools/eval.py
  function make_parser (line 27) | def make_parser():
  function main (line 116) | def main(exp, args, num_gpu):

FILE: tools/export_onnx.py
  function make_parser (line 17) | def make_parser():
  function main (line 62) | def main():

FILE: tools/export_torchscript.py
  function make_parser (line 14) | def make_parser():
  function main (line 46) | def main():

FILE: tools/train.py
  function make_parser (line 18) | def make_parser():
  function main (line 101) | def main(exp: Exp, args):

FILE: tools/trt.py
  function make_parser (line 17) | def make_parser():
  function main (line 39) | def main():

FILE: tools/visualize_assign.py
  class AssignVisualizer (line 20) | class AssignVisualizer(Trainer):
    method __init__ (line 22) | def __init__(self, exp: Exp, args):
    method train_one_iter (line 28) | def train_one_iter(self):
    method after_train (line 54) | def after_train(self):
  function assign_vis_parser (line 58) | def assign_vis_parser():
  function main (line 65) | def main(exp: Exp, args):

FILE: yolox/core/launch.py
  function _find_free_port (line 24) | def _find_free_port():
  function launch (line 39) | def launch(
  function _distributed_worker (line 101) | def _distributed_worker(

FILE: yolox/core/trainer.py
  class Trainer (line 37) | class Trainer:
    method __init__ (line 38) | def __init__(self, exp: Exp, args):
    method train (line 74) | def train(self):
    method train_in_epoch (line 84) | def train_in_epoch(self):
    method train_in_iter (line 90) | def train_in_iter(self):
    method train_one_iter (line 96) | def train_one_iter(self):
    method before_train (line 131) | def before_train(self):
    method after_train (line 199) | def after_train(self):
    method before_epoch (line 217) | def before_epoch(self):
    method after_epoch (line 232) | def after_epoch(self):
    method before_iter (line 239) | def before_iter(self):
    method after_iter (line 242) | def after_iter(self):
    method progress_in_iter (line 308) | def progress_in_iter(self):
    method resume_train (line 311) | def resume_train(self, model):
    method evaluate_and_save_model (line 346) | def evaluate_and_save_model(self):
    method save_ckpt (line 399) | def save_ckpt(self, ckpt_name, update_best_ckpt=False, ap=None):

FILE: yolox/data/data_augment.py
  function augment_hsv (line 21) | def augment_hsv(img, hgain=5, sgain=30, vgain=30):
  function get_aug_params (line 34) | def get_aug_params(value, center=0):
  function get_affine_matrix (line 46) | def get_affine_matrix(
  function apply_affine_to_bboxes (line 82) | def apply_affine_to_bboxes(targets, target_size, M, scale):
  function random_affine (line 114) | def random_affine(
  function _mirror (line 134) | def _mirror(image, boxes, prob=0.5):
  function preproc (line 142) | def preproc(img, input_size, swap=(2, 0, 1)):
  class TrainTransform (line 161) | class TrainTransform:
    method __init__ (line 162) | def __init__(self, max_labels=50, flip_prob=0.5, hsv_prob=1.0):
    method __call__ (line 167) | def __call__(self, image, targets, input_dim):
  class ValTransform (line 213) | class ValTransform:
    method __init__ (line 231) | def __init__(self, swap=(2, 0, 1), legacy=False):
    method __call__ (line 236) | def __call__(self, img, res, input_size):

FILE: yolox/data/data_prefetcher.py
  class DataPrefetcher (line 8) | class DataPrefetcher:
    method __init__ (line 16) | def __init__(self, loader):
    method preload (line 23) | def preload(self):
    method next (line 35) | def next(self):
    method _input_cuda_for_image (line 46) | def _input_cuda_for_image(self):
    method _record_stream_for_image (line 50) | def _record_stream_for_image(input):

FILE: yolox/data/dataloading.py
  function get_yolox_datadir (line 18) | def get_yolox_datadir():
  class DataLoader (line 32) | class DataLoader(torchDataLoader):
    method __init__ (line 40) | def __init__(self, *args, **kwargs):
    method close_mosaic (line 88) | def close_mosaic(self):
  function list_collate (line 92) | def list_collate(batch):
  function worker_init_reset_seed (line 109) | def worker_init_reset_seed(worker_id):

FILE: yolox/data/datasets/coco.py
  function remove_useless_info (line 15) | def remove_useless_info(coco):
  class COCODataset (line 34) | class COCODataset(CacheDataset):
    method __init__ (line 39) | def __init__(
    method __len__ (line 86) | def __len__(self):
    method _load_coco_annotations (line 89) | def _load_coco_annotations(self):
    method load_anno_from_ids (line 92) | def load_anno_from_ids(self, id_):
    method load_anno (line 130) | def load_anno(self, index):
    method load_resized_img (line 133) | def load_resized_img(self, index):
    method load_image (line 143) | def load_image(self, index):
    method read_img (line 154) | def read_img(self, index):
    method pull_item (line 157) | def pull_item(self, index):
    method __getitem__ (line 165) | def __getitem__(self, index):

FILE: yolox/data/datasets/datasets_wrapper.py
  class ConcatDataset (line 22) | class ConcatDataset(torchConcatDataset):
    method __init__ (line 23) | def __init__(self, datasets):
    method pull_item (line 29) | def pull_item(self, idx):
  class MixConcatDataset (line 44) | class MixConcatDataset(torchConcatDataset):
    method __init__ (line 45) | def __init__(self, datasets):
    method __getitem__ (line 51) | def __getitem__(self, index):
  class Dataset (line 72) | class Dataset(torchDataset):
    method __init__ (line 80) | def __init__(self, input_dimension, mosaic=True):
    method input_dim (line 86) | def input_dim(self):
    method mosaic_getitem (line 100) | def mosaic_getitem(getitem_fn):
  class CacheDataset (line 127) | class CacheDataset(Dataset, metaclass=ABCMeta):
    method __init__ (line 147) | def __init__(
    method __del__ (line 176) | def __del__(self):
    method read_img (line 181) | def read_img(self, index):
    method cache_images (line 190) | def cache_images(
    method cal_cache_occupy (line 262) | def cal_cache_occupy(self, num_imgs):
  function cache_read_img (line 272) | def cache_read_img(use_cache=True):

FILE: yolox/data/datasets/mosaicdetection.py
  function get_mosaic_coordinate (line 16) | def get_mosaic_coordinate(mosaic_image, mosaic_index, xc, yc, w, h, inpu...
  class MosaicDetection (line 37) | class MosaicDetection(Dataset):
    method __init__ (line 40) | def __init__(
    method __len__ (line 75) | def __len__(self):
    method __getitem__ (line 79) | def __getitem__(self, idx):
    method mixup (line 162) | def mixup(self, origin_img, origin_labels, input_dim):

FILE: yolox/data/datasets/voc.py
  class AnnotationTransform (line 23) | class AnnotationTransform(object):
    method __init__ (line 37) | def __init__(self, class_to_ind=None, keep_difficult=True):
    method __call__ (line 43) | def __call__(self, target):
  class VOCDetection (line 82) | class VOCDetection(CacheDataset):
    method __init__ (line 101) | def __init__(
    method __len__ (line 151) | def __len__(self):
    method _load_coco_annotations (line 154) | def _load_coco_annotations(self):
    method load_anno_from_ids (line 157) | def load_anno_from_ids(self, index):
    method load_anno (line 171) | def load_anno(self, index):
    method load_resized_img (line 174) | def load_resized_img(self, index):
    method load_image (line 185) | def load_image(self, index):
    method read_img (line 193) | def read_img(self, index):
    method pull_item (line 196) | def pull_item(self, index):
    method __getitem__ (line 213) | def __getitem__(self, index):
    method evaluate_detections (line 221) | def evaluate_detections(self, all_boxes, output_dir=None):
    method _get_voc_results_file_template (line 245) | def _get_voc_results_file_template(self):
    method _write_voc_results_file (line 253) | def _write_voc_results_file(self, all_boxes):
    method _do_python_eval (line 278) | def _do_python_eval(self, output_dir="output", iou=0.5):

FILE: yolox/data/samplers.py
  class YoloBatchSampler (line 14) | class YoloBatchSampler(torchBatchSampler):
    method __init__ (line 21) | def __init__(self, *args, mosaic=True, **kwargs):
    method __iter__ (line 25) | def __iter__(self):
  class InfiniteSampler (line 30) | class InfiniteSampler(Sampler):
    method __init__ (line 41) | def __init__(
    method __iter__ (line 69) | def __iter__(self):
    method _infinite_indices (line 75) | def _infinite_indices(self):
    method __len__ (line 84) | def __len__(self):

FILE: yolox/evaluators/coco_evaluator.py
  function per_class_AR_table (line 31) | def per_class_AR_table(coco_eval, class_names=COCO_CLASSES, headers=["cl...
  function per_class_AP_table (line 54) | def per_class_AP_table(coco_eval, class_names=COCO_CLASSES, headers=["cl...
  class COCOEvaluator (line 79) | class COCOEvaluator:
    method __init__ (line 85) | def __init__(
    method evaluate (line 116) | def evaluate(
    method convert_to_coco_format (line 207) | def convert_to_coco_format(self, outputs, info_imgs, ids, return_outpu...
    method evaluate_prediction (line 255) | def evaluate_prediction(self, data_dict, statistics):

FILE: yolox/evaluators/voc_eval.py
  function parse_rec (line 14) | def parse_rec(filename):
  function voc_ap (line 36) | def voc_ap(rec, prec, use_07_metric=False):
  function voc_eval (line 70) | def voc_eval(

FILE: yolox/evaluators/voc_evaluator.py
  class VOCEvaluator (line 19) | class VOCEvaluator:
    method __init__ (line 24) | def __init__(self, dataloader, img_size, confthre, nmsthre, num_classes):
    method evaluate (line 41) | def evaluate(
    method convert_to_voc_format (line 120) | def convert_to_voc_format(self, outputs, info_imgs, ids):
    method evaluate_prediction (line 140) | def evaluate_prediction(self, data_dict, statistics):

FILE: yolox/exp/base_exp.py
  class BaseExp (line 16) | class BaseExp(metaclass=ABCMeta):
    method __init__ (line 19) | def __init__(self):
    method get_model (line 27) | def get_model(self) -> Module:
    method get_dataset (line 31) | def get_dataset(self, cache: bool = False, cache_type: str = "ram"):
    method get_data_loader (line 35) | def get_data_loader(
    method get_optimizer (line 41) | def get_optimizer(self, batch_size: int) -> torch.optim.Optimizer:
    method get_lr_scheduler (line 45) | def get_lr_scheduler(
    method get_evaluator (line 51) | def get_evaluator(self):
    method eval (line 55) | def eval(self, model, evaluator, weights):
    method __repr__ (line 58) | def __repr__(self):
    method merge (line 67) | def merge(self, cfg_list):

FILE: yolox/exp/build.py
  function get_exp_by_file (line 10) | def get_exp_by_file(exp_file):
  function get_exp_by_name (line 20) | def get_exp_by_name(exp_name):
  function get_exp (line 27) | def get_exp(exp_file=None, exp_name=None):

FILE: yolox/exp/default/__init__.py
  class _ExpFinder (line 17) | class _ExpFinder(abc.MetaPathFinder):
    method find_spec (line 19) | def find_spec(self, name, path, target=None):

FILE: yolox/exp/yolox_base.py
  class Exp (line 16) | class Exp(BaseExp):
    method __init__ (line 17) | def __init__(self):
    method get_model (line 111) | def get_model(self):
    method get_dataset (line 131) | def get_dataset(self, cache: bool = False, cache_type: str = "ram"):
    method get_data_loader (line 155) | def get_data_loader(self, batch_size, is_distributed, no_aug=False, ca...
    method random_resize (line 224) | def random_resize(self, data_loader, epoch, rank, is_distributed):
    method preprocess (line 245) | def preprocess(self, inputs, targets, tsize):
    method get_optimizer (line 256) | def get_optimizer(self, batch_size):
    method get_lr_scheduler (line 284) | def get_lr_scheduler(self, lr, iters_per_epoch):
    method get_eval_dataset (line 299) | def get_eval_dataset(self, **kwargs):
    method get_eval_loader (line 312) | def get_eval_loader(self, batch_size, is_distributed, **kwargs):
    method get_evaluator (line 333) | def get_evaluator(self, batch_size, is_distributed, testdev=False, leg...
    method get_trainer (line 346) | def get_trainer(self, args):
    method eval (line 352) | def eval(self, model, evaluator, is_distributed, half=False, return_ou...
  function check_exp_value (line 356) | def check_exp_value(exp: Exp):

FILE: yolox/layers/cocoeval/cocoeval.cpp
  type COCOeval (line 10) | namespace COCOeval {
    function SortInstancesByDetectionScore (line 16) | void SortInstancesByDetectionScore(
    function SortInstancesByIgnore (line 32) | void SortInstancesByIgnore(
    function MatchDetectionsToGroundTruth (line 59) | void MatchDetectionsToGroundTruth(
    function EvaluateImages (line 140) | std::vector<ImageEvaluation> EvaluateImages(
    function list_to_vec (line 201) | std::vector<T> list_to_vec(const py::list& l) {
    function BuildSortedDetectionList (line 221) | int BuildSortedDetectionList(
    function ComputePrecisionRecallCurve (line 282) | void ComputePrecisionRecallCurve(
    function Accumulate (line 370) | py::dict Accumulate(

FILE: yolox/layers/cocoeval/cocoeval.h
  function namespace (line 12) | namespace COCOeval {

FILE: yolox/layers/fast_coco_eval_api.py
  class COCOeval_opt (line 17) | class COCOeval_opt(COCOeval):
    method __init__ (line 22) | def __init__(self, *args, **kwargs):
    method evaluate (line 26) | def evaluate(self):
    method accumulate (line 125) | def accumulate(self):

FILE: yolox/layers/jit_ops.py
  class JitOp (line 14) | class JitOp:
    method __init__ (line 23) | def __init__(self, name):
    method absolute_name (line 26) | def absolute_name(self) -> str:
    method sources (line 30) | def sources(self) -> List:
    method include_dirs (line 39) | def include_dirs(self) -> List:
    method define_macros (line 48) | def define_macros(self) -> List:
    method cxx_args (line 52) | def cxx_args(self) -> List:
    method nvcc_args (line 57) | def nvcc_args(self) -> List:
    method build_op (line 68) | def build_op(self):
    method load (line 80) | def load(self, verbose=True):
    method jit_load (line 89) | def jit_load(self, verbose=True):
    method clear_dynamic_library (line 114) | def clear_dynamic_library(self):
  class FastCOCOEvalOp (line 120) | class FastCOCOEvalOp(JitOp):
    method __init__ (line 122) | def __init__(self, name="fast_cocoeval"):
    method absolute_name (line 125) | def absolute_name(self):
    method sources (line 128) | def sources(self):
    method include_dirs (line 137) | def include_dirs(self):

FILE: yolox/models/build.py
  function create_yolox_model (line 32) | def create_yolox_model(name: str, pretrained: bool = True, num_classes: ...
  function yolox_nano (line 82) | def yolox_nano(pretrained: bool = True, num_classes: int = 80, device: s...
  function yolox_tiny (line 86) | def yolox_tiny(pretrained: bool = True, num_classes: int = 80, device: s...
  function yolox_s (line 90) | def yolox_s(pretrained: bool = True, num_classes: int = 80, device: str ...
  function yolox_m (line 94) | def yolox_m(pretrained: bool = True, num_classes: int = 80, device: str ...
  function yolox_l (line 98) | def yolox_l(pretrained: bool = True, num_classes: int = 80, device: str ...
  function yolox_x (line 102) | def yolox_x(pretrained: bool = True, num_classes: int = 80, device: str ...
  function yolov3 (line 106) | def yolov3(pretrained: bool = True, num_classes: int = 80, device: str =...
  function yolox_custom (line 110) | def yolox_custom(ckpt_path: str = None, exp_path: str = None, device: st...

FILE: yolox/models/darknet.py
  class Darknet (line 10) | class Darknet(nn.Module):
    method __init__ (line 14) | def __init__(
    method make_group_layer (line 59) | def make_group_layer(self, in_channels: int, num_blocks: int, stride: ...
    method make_spp_block (line 66) | def make_spp_block(self, filters_list, in_filters):
    method forward (line 82) | def forward(self, x):
  class CSPDarknet (line 97) | class CSPDarknet(nn.Module):
    method __init__ (line 98) | def __init__(
    method forward (line 167) | def forward(self, x):

FILE: yolox/models/losses.py
  class IOUloss (line 9) | class IOUloss(nn.Module):
    method __init__ (line 10) | def __init__(self, reduction="none", loss_type="iou"):
    method forward (line 15) | def forward(self, pred, target):

FILE: yolox/models/network_blocks.py
  class SiLU (line 9) | class SiLU(nn.Module):
    method forward (line 13) | def forward(x):
  function get_activation (line 17) | def get_activation(name="silu", inplace=True):
  class BaseConv (line 29) | class BaseConv(nn.Module):
    method __init__ (line 32) | def __init__(
    method forward (line 50) | def forward(self, x):
    method fuseforward (line 53) | def fuseforward(self, x):
  class DWConv (line 57) | class DWConv(nn.Module):
    method __init__ (line 60) | def __init__(self, in_channels, out_channels, ksize, stride=1, act="si...
    method forward (line 74) | def forward(self, x):
  class Bottleneck (line 79) | class Bottleneck(nn.Module):
    method __init__ (line 81) | def __init__(
    method forward (line 97) | def forward(self, x):
  class ResLayer (line 104) | class ResLayer(nn.Module):
    method __init__ (line 107) | def __init__(self, in_channels: int):
    method forward (line 117) | def forward(self, x):
  class SPPBottleneck (line 122) | class SPPBottleneck(nn.Module):
    method __init__ (line 125) | def __init__(
    method forward (line 140) | def forward(self, x):
  class CSPLayer (line 147) | class CSPLayer(nn.Module):
    method __init__ (line 150) | def __init__(
    method forward (line 180) | def forward(self, x):
  class Focus (line 188) | class Focus(nn.Module):
    method __init__ (line 191) | def __init__(self, in_channels, out_channels, ksize=1, stride=1, act="...
    method forward (line 195) | def forward(self, x):

FILE: yolox/models/yolo_fpn.py
  class YOLOFPN (line 12) | class YOLOFPN(nn.Module):
    method __init__ (line 17) | def __init__(
    method _make_cbl (line 38) | def _make_cbl(self, _in, _out, ks):
    method _make_embedding (line 41) | def _make_embedding(self, filters_list, in_filters):
    method load_pretrained_model (line 53) | def load_pretrained_model(self, filename="./weights/darknet53.mix.pth"):
    method forward (line 59) | def forward(self, inputs):

FILE: yolox/models/yolo_head.py
  class YOLOXHead (line 18) | class YOLOXHead(nn.Module):
    method __init__ (line 19) | def __init__(
    method initialize_biases (line 131) | def initialize_biases(self, prior_prob):
    method forward (line 142) | def forward(self, xin, labels=None, imgs=None):
    method get_output_and_grid (line 215) | def get_output_and_grid(self, output, k, stride, dtype):
    method decode_outputs (line 235) | def decode_outputs(self, outputs, dtype):
    method get_losses (line 255) | def get_losses(
    method get_l1_target (line 415) | def get_l1_target(self, l1_target, gt, stride, x_shifts, y_shifts, eps...
    method get_assignments (line 423) | def get_assignments(
    method get_geometry_constraint (line 513) | def get_geometry_constraint(
    method simota_matching (line 544) | def simota_matching(self, cost, pair_wise_ious, gt_classes, num_gt, fg...
    method visualize_assign_result (line 578) | def visualize_assign_result(self, xin, labels=None, imgs=None, save_pr...

FILE: yolox/models/yolo_pafpn.py
  class YOLOPAFPN (line 12) | class YOLOPAFPN(nn.Module):
    method __init__ (line 17) | def __init__(
    method forward (line 83) | def forward(self, input):

FILE: yolox/models/yolox.py
  class YOLOX (line 11) | class YOLOX(nn.Module):
    method __init__ (line 18) | def __init__(self, backbone=None, head=None):
    method forward (line 28) | def forward(self, x, targets=None):
    method visualize (line 50) | def visualize(self, x, targets, save_prefix="assign_vis_"):

FILE: yolox/tools/__init__.py
  class _PathFinder (line 16) | class _PathFinder(abc.MetaPathFinder):
    method find_spec (line 18) | def find_spec(self, name, path, target=None):

FILE: yolox/utils/allreduce_norm.py
  function get_async_norm_states (line 32) | def get_async_norm_states(module):
  function pyobj2tensor (line 41) | def pyobj2tensor(pyobj, device="cuda"):
  function tensor2pyobj (line 47) | def tensor2pyobj(tensor):
  function _get_reduce_op (line 52) | def _get_reduce_op(op_name):
  function all_reduce (line 59) | def all_reduce(py_dict, op="sum", group=None):
  function all_reduce_norm (line 97) | def all_reduce_norm(module):

FILE: yolox/utils/boxes.py
  function filter_box (line 21) | def filter_box(output, scale_range):
  function postprocess (line 32) | def postprocess(prediction, num_classes, conf_thre=0.7, nms_thre=0.45, c...
  function bboxes_iou (line 79) | def bboxes_iou(bboxes_a, bboxes_b, xyxy=True):
  function matrix_iou (line 105) | def matrix_iou(a, b):
  function adjust_box_anns (line 118) | def adjust_box_anns(bbox, scale_ratio, padw, padh, w_max, h_max):
  function xyxy2xywh (line 124) | def xyxy2xywh(bboxes):
  function xyxy2cxcywh (line 130) | def xyxy2cxcywh(bboxes):
  function cxcywh2xyxy (line 138) | def cxcywh2xyxy(bboxes):

FILE: yolox/utils/checkpoint.py
  function load_ckpt (line 11) | def load_ckpt(model, ckpt):
  function save_checkpoint (line 36) | def save_checkpoint(state, is_best, save_dir, model_name=""):

FILE: yolox/utils/compat.py
  function meshgrid (line 11) | def meshgrid(*tensors):

FILE: yolox/utils/demo_utils.py
  function random_color (line 15) | def random_color():
  function visualize_assign (line 19) | def visualize_assign(img, boxes, coords, match_results, save_name=None) ...
  function mkdir (line 52) | def mkdir(path):
  function nms (line 57) | def nms(boxes, scores, nms_thr):
  function multiclass_nms (line 87) | def multiclass_nms(boxes, scores, nms_thr, score_thr, class_agnostic=True):
  function multiclass_nms_class_aware (line 96) | def multiclass_nms_class_aware(boxes, scores, nms_thr, score_thr):
  function multiclass_nms_class_agnostic (line 120) | def multiclass_nms_class_agnostic(boxes, scores, nms_thr, score_thr):
  function demo_postprocess (line 139) | def demo_postprocess(outputs, img_size, p6=False):

FILE: yolox/utils/dist.py
  function get_num_devices (line 41) | def get_num_devices():
  function wait_for_the_master (line 52) | def wait_for_the_master(local_rank: int = None):
  function synchronize (line 75) | def synchronize():
  function get_world_size (line 89) | def get_world_size() -> int:
  function get_rank (line 97) | def get_rank() -> int:
  function get_local_rank (line 105) | def get_local_rank() -> int:
  function get_local_size (line 120) | def get_local_size() -> int:
  function is_main_process (line 132) | def is_main_process() -> bool:
  function _get_global_gloo_group (line 137) | def _get_global_gloo_group():
  function _serialize_to_tensor (line 148) | def _serialize_to_tensor(data, group):
  function _pad_to_largest_tensor (line 165) | def _pad_to_largest_tensor(tensor, group):
  function all_gather (line 195) | def all_gather(data, group=None):
  function gather (line 233) | def gather(data, dst=0, group=None):
  function shared_random_seed (line 277) | def shared_random_seed():
  function time_synchronized (line 290) | def time_synchronized():

FILE: yolox/utils/ema.py
  function is_parallel (line 13) | def is_parallel(model):
  class ModelEMA (line 22) | class ModelEMA:
    method __init__ (line 33) | def __init__(self, model, decay=0.9999, updates=0):
    method update (line 48) | def update(self, model):

FILE: yolox/utils/logger.py
  function get_caller_name (line 16) | def get_caller_name(depth=0):
  class StreamToLoguru (line 33) | class StreamToLoguru:
    method __init__ (line 38) | def __init__(self, level="INFO", caller_names=("apex", "pycocotools")):
    method write (line 49) | def write(self, buf):
    method flush (line 59) | def flush(self):
    method isatty (line 63) | def isatty(self):
    method fileno (line 71) | def fileno(self):
  function redirect_sys_output (line 76) | def redirect_sys_output(log_level="INFO"):
  function setup_logger (line 82) | def setup_logger(save_dir, distributed_rank=0, filename="log.txt", mode=...
  class WandbLogger (line 117) | class WandbLogger(object):
    method __init__ (line 129) | def __init__(self,
    method run (line 222) | def run(self):
    method _log_validation_set (line 236) | def _log_validation_set(self, val_dataset):
    method _convert_prediction_format (line 266) | def _convert_prediction_format(self, predictions):
    method log_metrics (line 303) | def log_metrics(self, metrics, step=None):
    method log_images (line 320) | def log_images(self, predictions):
    method save_checkpoint (line 391) | def save_checkpoint(self, save_dir, model_name, is_best, metadata=None):
    method finish (line 426) | def finish(self):
    method initialize_wandb_logger (line 430) | def initialize_wandb_logger(cls, args, exp, val_dataset):

FILE: yolox/utils/lr_scheduler.py
  class LRScheduler (line 9) | class LRScheduler:
    method __init__ (line 10) | def __init__(self, name, lr, iters_per_epoch, total_epochs, **kwargs):
    method update_lr (line 33) | def update_lr(self, iters):
    method _get_lr_func (line 36) | def _get_lr_func(self, name):
  function cos_lr (line 97) | def cos_lr(lr, total_iters, iters):
  function warm_cos_lr (line 103) | def warm_cos_lr(lr, total_iters, warmup_total_iters, warmup_lr_start, it...
  function yolox_warm_cos_lr (line 121) | def yolox_warm_cos_lr(
  function yolox_semi_warm_cos_lr (line 151) | def yolox_semi_warm_cos_lr(
  function multistep_lr (line 201) | def multistep_lr(lr, milestones, gamma, iters):

FILE: yolox/utils/metric.py
  function get_total_and_free_memory_in_Mb (line 24) | def get_total_and_free_memory_in_Mb(cuda_device):
  function occupy_mem (line 36) | def occupy_mem(cuda_device, mem_ratio=0.9):
  function gpu_mem_usage (line 48) | def gpu_mem_usage():
  function mem_usage (line 56) | def mem_usage():
  class AverageMeter (line 65) | class AverageMeter:
    method __init__ (line 70) | def __init__(self, window_size=50):
    method update (line 75) | def update(self, value):
    method median (line 81) | def median(self):
    method avg (line 86) | def avg(self):
    method global_avg (line 92) | def global_avg(self):
    method latest (line 96) | def latest(self):
    method total (line 100) | def total(self):
    method reset (line 103) | def reset(self):
    method clear (line 108) | def clear(self):
  class MeterBuffer (line 112) | class MeterBuffer(defaultdict):
    method __init__ (line 115) | def __init__(self, window_size=20):
    method reset (line 119) | def reset(self):
    method get_filtered_meter (line 123) | def get_filtered_meter(self, filter_key="time"):
    method update (line 126) | def update(self, values=None, **kwargs):
    method clear_meters (line 135) | def clear_meters(self):

FILE: yolox/utils/mlflow_logger.py
  class MlflowLogger (line 29) | class MlflowLogger:
    method __init__ (line 33) | def __init__(self):
    method is_required_library_available (line 60) | def is_required_library_available(self):
    method flatten_dict (line 73) | def flatten_dict(self, d: MutableMapping, parent_key: str = "", delimi...
    method setup (line 97) | def setup(self, args, exp):
    method log_params_mlflow (line 217) | def log_params_mlflow(self, params_dict):
    method convert_exp_todict (line 256) | def convert_exp_todict(self, exp):
    method on_log (line 274) | def on_log(self, args, exp, step, logs):
    method on_train_end (line 309) | def on_train_end(self, args, file_name, metadata):
    method save_log_file (line 334) | def save_log_file(self, args, file_name):
    method save_checkpoints (line 349) | def save_checkpoints(self, args, exp, file_name, epoch, metadata, upda...
    method mlflow_save_pyfunc_model (line 382) | def mlflow_save_pyfunc_model(self, metadata, artifact_path, mlflow_out...
    method __del__ (line 407) | def __del__(self):

FILE: yolox/utils/model_utils.py
  function get_model_info (line 22) | def get_model_info(model: nn.Module, tsize: Sequence[int]) -> str:
  function fuse_conv_and_bn (line 35) | def fuse_conv_and_bn(conv: nn.Conv2d, bn: nn.BatchNorm2d) -> nn.Conv2d:
  function fuse_model (line 80) | def fuse_model(model: nn.Module) -> nn.Module:
  function replace_module (line 99) | def replace_module(module, replaced_module_type, new_module_type, replac...
  function freeze_module (line 131) | def freeze_module(module: nn.Module, name=None) -> nn.Module:
  function adjust_status (line 160) | def adjust_status(module: nn.Module, training: bool = False) -> nn.Module:

FILE: yolox/utils/setup_env.py
  function configure_nccl (line 16) | def configure_nccl():
  function configure_omp (line 28) | def configure_omp(num_threads=1):
  function configure_module (line 51) | def configure_module(ulimit_value=8192):

FILE: yolox/utils/visualize.py
  function vis (line 11) | def vis(img, boxes, scores, cls_ids, conf=0.5, class_names=None):

Download .json

Condensed preview — 154 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (661K chars).

[
  {
    "path": ".github/workflows/ci.yaml",
    "chars": 1274,
    "preview": "# This is a basic workflow to help you get started with Actions\n\nname: CI\n\n# Controls when the action will run. Triggers"
  },
  {
    "path": ".github/workflows/format_check.sh",
    "chars": 313,
    "preview": "#!/bin/bash -e\n\nset -e\n\nexport PYTHONPATH=$PWD:$PYTHONPATH\n\nflake8 yolox exps tools || flake8_ret=$?\nif [ \"$flake8_ret\" "
  },
  {
    "path": ".gitignore",
    "chars": 3235,
    "preview": "### Linux ###\n*~\n\n# user experiments directory\nYOLOX_outputs/\ndatasets/\n# do not ignore datasets under yolox/data\n!*yolo"
  },
  {
    "path": ".pre-commit-config.yaml",
    "chars": 1118,
    "preview": "repos:\n  - repo: https://github.com/pycqa/flake8\n    rev: 3.8.3\n    hooks:\n      - id: flake8\n  - repo: https://github.c"
  },
  {
    "path": ".readthedocs.yaml",
    "chars": 537,
    "preview": "# .readthedocs.yaml\n# Read the Docs configuration file\n# See https://docs.readthedocs.io/en/stable/config-file/v2.html f"
  },
  {
    "path": "LICENSE",
    "chars": 11371,
    "preview": "                                 Apache License\n                           Version 2.0, January 2004\n                   "
  },
  {
    "path": "MANIFEST.in",
    "chars": 75,
    "preview": "include requirements.txt\nrecursive-include yolox *.cpp *.h *.cu *.cuh *.cc\n"
  },
  {
    "path": "README.md",
    "chars": 13038,
    "preview": "<div align=\"center\"><img src=\"assets/logo.png\" width=\"350\"></div>\n<img src=\"assets/demo.png\" >\n\n## Introduction\nYOLOX is"
  },
  {
    "path": "SECURITY.md",
    "chars": 1478,
    "preview": "# Security Policy\n\n## Reporting a Vulnerability\n\n### Types of Security Issues\nWe actively monitor:  \n- Code vulnerabilit"
  },
  {
    "path": "demo/MegEngine/cpp/README.md",
    "chars": 6369,
    "preview": "# YOLOX-CPP-MegEngine\n\nCpp file compile of YOLOX object detection base on [MegEngine](https://github.com/MegEngine/MegEn"
  },
  {
    "path": "demo/MegEngine/cpp/build.sh",
    "chars": 2514,
    "preview": "#!/usr/bin/env bash\nset -e\n\nif [ -z $CXX ];then\n    echo \"please export you c++ toolchain to CXX\"\n    echo \"for example:"
  },
  {
    "path": "demo/MegEngine/cpp/yolox.cpp",
    "chars": 15952,
    "preview": "// Copyright (C) 2018-2021 Intel Corporation\n// SPDX-License-Identifier: Apache-2.0\n\n#include \"megbrain/gopt/inference.h"
  },
  {
    "path": "demo/MegEngine/python/README.md",
    "chars": 868,
    "preview": "# YOLOX-Python-MegEngine\n\nPython version of YOLOX object detection base on [MegEngine](https://github.com/MegEngine/MegE"
  },
  {
    "path": "demo/MegEngine/python/build.py",
    "chars": 1506,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n\nimport megengine as mge\nimport megengine.module as M\n\nfrom models.yolo_fp"
  },
  {
    "path": "demo/MegEngine/python/convert_weights.py",
    "chars": 1661,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\nimport argparse\nfrom collections import OrderedDict\n\nimport megengine as m"
  },
  {
    "path": "demo/MegEngine/python/demo.py",
    "chars": 8122,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "demo/MegEngine/python/dump.py",
    "chars": 1269,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\n\nimport "
  },
  {
    "path": "demo/MegEngine/python/models/__init__.py",
    "chars": 259,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nfrom .darknet import CSP"
  },
  {
    "path": "demo/MegEngine/python/models/darknet.py",
    "chars": 5684,
    "preview": "#!/usr/bin/env python3\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.modu"
  },
  {
    "path": "demo/MegEngine/python/models/network_blocks.py",
    "chars": 5911,
    "preview": "#!/usr/bin/env python3\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.func"
  },
  {
    "path": "demo/MegEngine/python/models/yolo_fpn.py",
    "chars": 2220,
    "preview": "#!/usr/bin/env python3\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.func"
  },
  {
    "path": "demo/MegEngine/python/models/yolo_head.py",
    "chars": 6507,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.functio"
  },
  {
    "path": "demo/MegEngine/python/models/yolo_pafpn.py",
    "chars": 3521,
    "preview": "#!/usr/bin/env python3\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.modu"
  },
  {
    "path": "demo/MegEngine/python/models/yolox.py",
    "chars": 917,
    "preview": "#!/usr/bin/env python3\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport megengine.modu"
  },
  {
    "path": "demo/ONNXRuntime/README.md",
    "chars": 3177,
    "preview": "## YOLOX-ONNXRuntime in Python\n\nThis doc introduces how to convert your pytorch model into onnx, and how to run an onnxr"
  },
  {
    "path": "demo/ONNXRuntime/onnx_inference.py",
    "chars": 2536,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport os\n\nimport cv2\nimport nu"
  },
  {
    "path": "demo/OpenVINO/README.md",
    "chars": 68,
    "preview": "## YOLOX for OpenVINO\n\n* [C++ Demo](./cpp)\n* [Python Demo](./python)"
  },
  {
    "path": "demo/OpenVINO/cpp/CMakeLists.txt",
    "chars": 483,
    "preview": "cmake_minimum_required(VERSION 3.4.1)\nset(CMAKE_CXX_STANDARD 14)\n\nproject(yolox_openvino_demo)\n\nfind_package(OpenCV REQU"
  },
  {
    "path": "demo/OpenVINO/cpp/README.md",
    "chars": 3259,
    "preview": "# YOLOX-OpenVINO in C++\n\nThis tutorial includes a C++ demo for OpenVINO, as well as some converted models.\n\n### Download"
  },
  {
    "path": "demo/OpenVINO/cpp/yolox_openvino.cpp",
    "chars": 18794,
    "preview": "// Copyright (C) 2018-2021 Intel Corporation\n// SPDX-License-Identifier: Apache-2.0\n//\n\n#include <iterator>\n#include <me"
  },
  {
    "path": "demo/OpenVINO/python/README.md",
    "chars": 3248,
    "preview": "# YOLOX-OpenVINO in Python\n\nThis tutorial includes a Python demo for OpenVINO, as well as some converted models.\n\n### Do"
  },
  {
    "path": "demo/OpenVINO/python/openvino_inference.py",
    "chars": 5684,
    "preview": "#!/usr/bin/env python3\n# -*- coding: utf-8 -*-\n# Copyright (C) 2018-2021 Intel Corporation\n# SPDX-License-Identifier: Ap"
  },
  {
    "path": "demo/TensorRT/cpp/CMakeLists.txt",
    "chars": 1108,
    "preview": "cmake_minimum_required(VERSION 2.6)\n\nproject(yolox)\n\nadd_definitions(-std=c++11)\n\noption(CUDA_USE_STATIC_CUDA_RUNTIME OF"
  },
  {
    "path": "demo/TensorRT/cpp/README.md",
    "chars": 1610,
    "preview": "# YOLOX-TensorRT in C++\n\nAs YOLOX models are easy to convert to tensorrt using [torch2trt gitrepo](https://github.com/NV"
  },
  {
    "path": "demo/TensorRT/cpp/logging.h",
    "chars": 16559,
    "preview": "/*\n * Copyright (c) 2019, NVIDIA CORPORATION. All rights reserved.\n *\n * Licensed under the Apache License, Version 2.0 "
  },
  {
    "path": "demo/TensorRT/cpp/yolox.cpp",
    "chars": 17099,
    "preview": "#include <fstream>\n#include <iostream>\n#include <sstream>\n#include <numeric>\n#include <chrono>\n#include <vector>\n#includ"
  },
  {
    "path": "demo/TensorRT/python/README.md",
    "chars": 1511,
    "preview": "# YOLOX-TensorRT in Python\n\nThis tutorial includes a Python demo for TensorRT.\n\n## Install TensorRT Toolkit\n\nPlease foll"
  },
  {
    "path": "demo/ncnn/README.md",
    "chars": 251,
    "preview": "# YOLOX-ncnn\n\nCompile files of YOLOX object detection base on [ncnn](https://github.com/Tencent/ncnn).  \nYOLOX is includ"
  },
  {
    "path": "demo/ncnn/android/README.md",
    "chars": 1127,
    "preview": "# YOLOX-Android-ncnn\n\nAndoird app of YOLOX object detection base on [ncnn](https://github.com/Tencent/ncnn)\n\n\n## Tutoria"
  },
  {
    "path": "demo/ncnn/android/app/build.gradle",
    "chars": 496,
    "preview": "apply plugin: 'com.android.application'\n\nandroid {\n    compileSdkVersion 24\n    buildToolsVersion \"29.0.2\"\n\n    defaultC"
  },
  {
    "path": "demo/ncnn/android/app/src/main/AndroidManifest.xml",
    "chars": 613,
    "preview": "<?xml version=\"1.0\" encoding=\"utf-8\"?>\n<manifest xmlns:android=\"http://schemas.android.com/apk/res/android\"\n      packag"
  },
  {
    "path": "demo/ncnn/android/app/src/main/assets/yolox.param",
    "chars": 17475,
    "preview": "7767517\n220 250\nInput                    images                   0 1 images\nYoloV5Focus              focus             "
  },
  {
    "path": "demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/MainActivity.java",
    "chars": 8188,
    "preview": "// Some code in this file is based on:\n// https://github.com/nihui/ncnn-android-yolov5/blob/master/app/src/main/java/com"
  },
  {
    "path": "demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/YOLOXncnn.java",
    "chars": 566,
    "preview": "// Copyright (C) Megvii, Inc. and its affiliates. All rights reserved.\n\npackage com.megvii.yoloXncnn;\n\nimport android.co"
  },
  {
    "path": "demo/ncnn/android/app/src/main/java/com/megvii/yoloXncnn/yoloXncnn.java",
    "chars": 566,
    "preview": "// Copyright (C) Megvii, Inc. and its affiliates. All rights reserved.\n\npackage com.megvii.yoloXncnn;\n\nimport android.co"
  },
  {
    "path": "demo/ncnn/android/app/src/main/jni/CMakeLists.txt",
    "chars": 290,
    "preview": "project(yoloXncnn)\n\ncmake_minimum_required(VERSION 3.4.1)\n\nset(ncnn_DIR ${CMAKE_SOURCE_DIR}/ncnn-20210525-android-vulkan"
  },
  {
    "path": "demo/ncnn/android/app/src/main/jni/yoloXncnn_jni.cpp",
    "chars": 14041,
    "preview": "// Some code in this file is based on:\n// https://github.com/nihui/ncnn-android-yolov5/blob/master/app/src/main/jni/yolo"
  },
  {
    "path": "demo/ncnn/android/app/src/main/res/layout/main.xml",
    "chars": 1116,
    "preview": "<?xml version=\"1.0\" encoding=\"utf-8\"?>\n<LinearLayout xmlns:android=\"http://schemas.android.com/apk/res/android\"\n    andr"
  },
  {
    "path": "demo/ncnn/android/app/src/main/res/values/strings.xml",
    "chars": 111,
    "preview": "<?xml version=\"1.0\" encoding=\"utf-8\"?>\n<resources>\n    <string name=\"app_name\">yoloXncnn</string>\n</resources>\n"
  },
  {
    "path": "demo/ncnn/android/build.gradle",
    "chars": 335,
    "preview": "// Top-level build file where you can add configuration options common to all sub-projects/modules.\nbuildscript {\n    re"
  },
  {
    "path": "demo/ncnn/android/gradle/wrapper/gradle-wrapper.properties",
    "chars": 232,
    "preview": "#Sun Aug 25 10:34:48 CST 2019\ndistributionBase=GRADLE_USER_HOME\ndistributionPath=wrapper/dists\nzipStoreBase=GRADLE_USER_"
  },
  {
    "path": "demo/ncnn/android/gradlew",
    "chars": 5296,
    "preview": "#!/usr/bin/env sh\n\n##############################################################################\n##\n##  Gradle start up"
  },
  {
    "path": "demo/ncnn/android/gradlew.bat",
    "chars": 2176,
    "preview": "@if \"%DEBUG%\" == \"\" @echo off\n@rem ##########################################################################\n@rem\n@rem "
  },
  {
    "path": "demo/ncnn/android/settings.gradle",
    "chars": 15,
    "preview": "include ':app'\n"
  },
  {
    "path": "demo/ncnn/cpp/README.md",
    "chars": 4571,
    "preview": "# YOLOX-CPP-ncnn\n\nCpp file compile of YOLOX object detection base on [ncnn](https://github.com/Tencent/ncnn).  \n\n## Tuto"
  },
  {
    "path": "demo/ncnn/cpp/yolox.cpp",
    "chars": 12826,
    "preview": "// This file is wirtten base on the following file:\n// https://github.com/Tencent/ncnn/blob/master/examples/yolov5.cpp\n/"
  },
  {
    "path": "demo/nebullvm/README.md",
    "chars": 5027,
    "preview": "# **Accelerate YOLOX inference with nebullvm in Python**\n\nThis document shows how to accelerate YOLOX inference time wit"
  },
  {
    "path": "demo/nebullvm/nebullvm_optimization.py",
    "chars": 1450,
    "preview": "import torch\nimport time\nfrom nebullvm.api.functions import optimize_model # Install DL compilers\nfrom yolox.exp import "
  },
  {
    "path": "docs/.gitignore",
    "chars": 6,
    "preview": "_build"
  },
  {
    "path": "docs/Makefile",
    "chars": 629,
    "preview": "# Minimal makefile for Sphinx documentation\n# Copyright (c) Facebook, Inc. and its affiliates.\n\n# You can set these vari"
  },
  {
    "path": "docs/_static/css/custom.css",
    "chars": 556,
    "preview": "/*\n * Copyright (c) Facebook, Inc. and its affiliates.\n * some extra css to make markdown look similar between github/sp"
  },
  {
    "path": "docs/assignment_visualization.md",
    "chars": 1531,
    "preview": "# Visualize label assignment\n\nThis tutorial explains how to visualize your label asssignment result when training with Y"
  },
  {
    "path": "docs/cache.md",
    "chars": 4931,
    "preview": "# Cache Custom Data\n\nThe caching feature is specifically tailored for users with ample memory resources. However, we sti"
  },
  {
    "path": "docs/conf.py",
    "chars": 12831,
    "preview": "# -*- coding: utf-8 -*-\n# Code are based on\n# https://github.com/facebookresearch/detectron2/blob/master/docs/conf.py\n# "
  },
  {
    "path": "docs/freeze_module.md",
    "chars": 1106,
    "preview": "# Freeze module\n\nThis page guide users to freeze module in YOLOX.  \nExp controls everything in YOLOX, so let's start fro"
  },
  {
    "path": "docs/index.rst",
    "chars": 547,
    "preview": "\nWelcome to YOLOX's documentation!\n======================================\n\n.. image:: ../assets/logo.png\n\n.. toctree::\n "
  },
  {
    "path": "docs/manipulate_training_image_size.md",
    "chars": 1798,
    "preview": "# Manipulating Your Training Image Size\n\nThis tutorial explains how to control your image size when training on your own"
  },
  {
    "path": "docs/mlflow_integration.md",
    "chars": 3159,
    "preview": "## MLFlow Integration\nYOLOX now supports MLFlow integration. MLFlow is an open-source platform for managing the end-to-e"
  },
  {
    "path": "docs/model_zoo.md",
    "chars": 5064,
    "preview": "# Model Zoo\n\n## Standard Models.\n\n|Model |size |mAP<sup>val<br>0.5:0.95 |mAP<sup>test<br>0.5:0.95 | Speed V100<br>(ms) |"
  },
  {
    "path": "docs/quick_run.md",
    "chars": 3919,
    "preview": "\n# Get Started\n\n## 1.Installation\n\nStep1. Install YOLOX.\n```shell\ngit clone git@github.com:Megvii-BaseDetection/YOLOX.gi"
  },
  {
    "path": "docs/requirements-doc.txt",
    "chars": 230,
    "preview": "docutils==0.16\n# https://github.com/sphinx-doc/sphinx/commit/7acd3ada3f38076af7b2b5c9f3b60bb9c2587a3d\nsphinx==3.2.0\nreco"
  },
  {
    "path": "docs/train_custom_data.md",
    "chars": 7138,
    "preview": "# Train Custom Data\n\nThis page explains how to train your own custom data with YOLOX.\n\nWe take an example of fine-tuning"
  },
  {
    "path": "docs/updates_note.md",
    "chars": 1803,
    "preview": "\n# Updates notes\n\n## 【2021/08/19】\n\n* Support image caching for faster training, which requires large system RAM. \n* Remo"
  },
  {
    "path": "exps/default/__init__.py",
    "chars": 95,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n"
  },
  {
    "path": "exps/default/yolov3.py",
    "chars": 998,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nimport torch."
  },
  {
    "path": "exps/default/yolox_l.py",
    "chars": 355,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nfrom yolox.ex"
  },
  {
    "path": "exps/default/yolox_m.py",
    "chars": 357,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nfrom yolox.ex"
  },
  {
    "path": "exps/default/yolox_nano.py",
    "chars": 1517,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nimport torch."
  },
  {
    "path": "exps/default/yolox_s.py",
    "chars": 357,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nfrom yolox.ex"
  },
  {
    "path": "exps/default/yolox_tiny.py",
    "chars": 540,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nfrom yolox.ex"
  },
  {
    "path": "exps/default/yolox_x.py",
    "chars": 357,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nfrom yolox.ex"
  },
  {
    "path": "exps/example/custom/nano.py",
    "chars": 1579,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\n\nimport torch."
  },
  {
    "path": "exps/example/custom/yolox_s.py",
    "chars": 665,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\nimport os\n\nfrom yolox.exp"
  },
  {
    "path": "exps/example/yolox_voc/yolox_voc_s.py",
    "chars": 1966,
    "preview": "# encoding: utf-8\nimport os\n\nfrom yolox.data import get_yolox_datadir\nfrom yolox.exp import Exp as MyExp\n\n\nclass Exp(MyE"
  },
  {
    "path": "hubconf.py",
    "chars": 498,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n\n\"\"\"\nUsage example:\n    import torch\n    model = torch.hub.load(\"Megvii-Ba"
  },
  {
    "path": "requirements.txt",
    "chars": 277,
    "preview": "# TODO: Update with exact module version\nnumpy\ntorch>=1.7\nopencv_python\nloguru\ntqdm\ntorchvision\nthop\nninja\ntabulate\npsut"
  },
  {
    "path": "setup.cfg",
    "chars": 618,
    "preview": "[isort]\nline_length = 100\nmulti_line_output = 3\nbalanced_wrapping = True\nknown_standard_library = setuptools\nknown_third"
  },
  {
    "path": "setup.py",
    "chars": 2710,
    "preview": "#!/usr/bin/env python\n# Copyright (c) Megvii, Inc. and its affiliates. All Rights Reserved\n\nimport re\nimport setuptools\n"
  },
  {
    "path": "tests/__init__.py",
    "chars": 46,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n"
  },
  {
    "path": "tests/utils/test_model_utils.py",
    "chars": 3499,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport unittest\n\nimport "
  },
  {
    "path": "tools/__init__.py",
    "chars": 95,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n"
  },
  {
    "path": "tools/demo.py",
    "chars": 10175,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "tools/eval.py",
    "chars": 6477,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "tools/export_onnx.py",
    "chars": 3638,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "tools/export_torchscript.py",
    "chars": 2266,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "tools/train.py",
    "chars": 4182,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport r"
  },
  {
    "path": "tools/trt.py",
    "chars": 2513,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport argparse\nimport o"
  },
  {
    "path": "tools/visualize_assign.py",
    "chars": 2700,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\nimport sys\nimport random\nimport time\n"
  },
  {
    "path": "yolox/__init__.py",
    "chars": 69,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n\n__version__ = \"0.3.0\"\n"
  },
  {
    "path": "yolox/core/__init__.py",
    "chars": 152,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nfrom .launch import laun"
  },
  {
    "path": "yolox/core/launch.py",
    "chars": 4387,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Code are based on\n# https://github.com/facebookresearch/detectron2/blob/"
  },
  {
    "path": "yolox/core/trainer.py",
    "chars": 15628,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport datetime\nimport os\nimport time\nfrom logu"
  },
  {
    "path": "yolox/data/__init__.py",
    "chars": 354,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nfrom .data_augment impor"
  },
  {
    "path": "yolox/data/data_augment.py",
    "chars": 7360,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\"\"\"\nData augmentation fun"
  },
  {
    "path": "yolox/data/data_prefetcher.py",
    "chars": 1649,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport torch\n\n\nclass Dat"
  },
  {
    "path": "yolox/data/dataloading.py",
    "chars": 3671,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport os\nimport random\n"
  },
  {
    "path": "yolox/data/datasets/__init__.py",
    "chars": 325,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nfrom .coco import COCODa"
  },
  {
    "path": "yolox/data/datasets/coco.py",
    "chars": 6363,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\nimport copy\nimport os\n\nim"
  },
  {
    "path": "yolox/data/datasets/coco_classes.py",
    "chars": 1296,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nCOCO_CLASSES = (\n    \"pe"
  },
  {
    "path": "yolox/data/datasets/datasets_wrapper.py",
    "chars": 10878,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport bisect\nimport cop"
  },
  {
    "path": "yolox/data/datasets/mosaicdetection.py",
    "chars": 9573,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport random\n\nimport cv"
  },
  {
    "path": "yolox/data/datasets/voc.py",
    "chars": 11946,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Code are based on\n# https://github.com/fmassa/vision/blob/voc_dataset/to"
  },
  {
    "path": "yolox/data/datasets/voc_classes.py",
    "chars": 442,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\n# VOC_CLASSES = ( '__bac"
  },
  {
    "path": "yolox/data/samplers.py",
    "chars": 2854,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport itertools\nfrom ty"
  },
  {
    "path": "yolox/evaluators/__init__.py",
    "chars": 178,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nfrom .coco_evaluator imp"
  },
  {
    "path": "yolox/evaluators/coco_evaluator.py",
    "chars": 11467,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport contextlib\nimport"
  },
  {
    "path": "yolox/evaluators/voc_eval.py",
    "chars": 5631,
    "preview": "#!/usr/bin/env python3\n# Code are based on\n# https://github.com/rbgirshick/py-faster-rcnn/blob/master/lib/datasets/voc_e"
  },
  {
    "path": "yolox/evaluators/voc_evaluator.py",
    "chars": 6564,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii, Inc. and its affiliates.\n\nimport sys\nimport tempfi"
  },
  {
    "path": "yolox/exp/__init__.py",
    "chars": 175,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nfrom .base_exp import BaseExp\nfrom .build impor"
  },
  {
    "path": "yolox/exp/base_exp.py",
    "chars": 2600,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport ast\nimport pprint\nfrom abc import ABCMet"
  },
  {
    "path": "yolox/exp/build.py",
    "chars": 1237,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport importlib\nimport "
  },
  {
    "path": "yolox/exp/default/__init__.py",
    "chars": 990,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\n# This file is used for "
  },
  {
    "path": "yolox/exp/yolox_base.py",
    "chars": 13611,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport os\nimport random\n\nimport torch\nimport to"
  },
  {
    "path": "yolox/layers/__init__.py",
    "chars": 385,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\n# import torch first to "
  },
  {
    "path": "yolox/layers/cocoeval/cocoeval.cpp",
    "chars": 20824,
    "preview": "// Copyright (c) Facebook, Inc. and its affiliates. All Rights Reserved\n#include \"cocoeval.h\"\n#include <time.h>\n#include"
  },
  {
    "path": "yolox/layers/cocoeval/cocoeval.h",
    "chars": 3978,
    "preview": "// Copyright (c) Facebook, Inc. and its affiliates. All Rights Reserved\n#pragma once\n\n#include <pybind11/numpy.h>\n#inclu"
  },
  {
    "path": "yolox/layers/fast_coco_eval_api.py",
    "chars": 5795,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# This file comes from\n# https://github.com/facebookresearch/detectron2/bl"
  },
  {
    "path": "yolox/layers/jit_ops.py",
    "chars": 4462,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii, Inc. and its affiliates. All Rights Reserved\n\nimport glob\nimport importli"
  },
  {
    "path": "yolox/models/__init__.py",
    "chars": 308,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nfrom .build import *\nfro"
  },
  {
    "path": "yolox/models/build.py",
    "chars": 4266,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n\nimport torch\nfrom torch import nn\nfrom torch.hub import load_state_dict_f"
  },
  {
    "path": "yolox/models/darknet.py",
    "chars": 6019,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nfrom torch import nn\n\n"
  },
  {
    "path": "yolox/models/losses.py",
    "chars": 1677,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport torch\nimport to"
  },
  {
    "path": "yolox/models/network_blocks.py",
    "chars": 6092,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport torch\nimport to"
  },
  {
    "path": "yolox/models/yolo_fpn.py",
    "chars": 2476,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport torch\nimport to"
  },
  {
    "path": "yolox/models/yolo_head.py",
    "chars": 23941,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport math\nfrom loguru "
  },
  {
    "path": "yolox/models/yolo_pafpn.py",
    "chars": 3530,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport torch\nimport to"
  },
  {
    "path": "yolox/models/yolox.py",
    "chars": 1542,
    "preview": "#!/usr/bin/env python\n# -*- encoding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport torch.nn as nn\n"
  },
  {
    "path": "yolox/tools/__init__.py",
    "chars": 966,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\n# This file is used for package installation. S"
  },
  {
    "path": "yolox/utils/__init__.py",
    "chars": 492,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nfrom .allreduce_norm import *\nfrom .boxes impor"
  },
  {
    "path": "yolox/utils/allreduce_norm.py",
    "chars": 2835,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport pickle\nfrom colle"
  },
  {
    "path": "yolox/utils/boxes.py",
    "chars": 4712,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport numpy as np\n\nimport torch\nimport torchvi"
  },
  {
    "path": "yolox/utils/checkpoint.py",
    "chars": 1312,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\nimport os\nimport shutil\nf"
  },
  {
    "path": "yolox/utils/compat.py",
    "chars": 310,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n\nimport torch\n\n_TORCH_VER = [int(x) for x in torch.__version__.split(\".\")["
  },
  {
    "path": "yolox/utils/demo_utils.py",
    "chars": 5101,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport os\nimport random\n\nimport cv2\nimport nump"
  },
  {
    "path": "yolox/utils/dist.py",
    "chars": 8062,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# This file mainly comes from\n# https://github.com/facebookresearch/detect"
  },
  {
    "path": "yolox/utils/ema.py",
    "chars": 2073,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\nimport math\nfrom copy imp"
  },
  {
    "path": "yolox/utils/logger.py",
    "chars": 14636,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport inspect\nimport os\nimport sys\nfrom collec"
  },
  {
    "path": "yolox/utils/lr_scheduler.py",
    "chars": 6551,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport math\nfrom functoo"
  },
  {
    "path": "yolox/utils/metric.py",
    "chars": 3456,
    "preview": "#!/usr/bin/env python3\n# -*- coding: utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\nimport functools\nimport "
  },
  {
    "path": "yolox/utils/mlflow_logger.py",
    "chars": 19542,
    "preview": "#!/usr/bin/env python3\n# Copyright (c) Megvii Inc. All rights reserved.\n# Please read docs/mlflow_integration.md for mor"
  },
  {
    "path": "yolox/utils/model_utils.py",
    "chars": 5614,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport contextlib\nfrom c"
  },
  {
    "path": "yolox/utils/setup_env.py",
    "chars": 2675,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport os\nimport subproc"
  },
  {
    "path": "yolox/utils/visualize.py",
    "chars": 3599,
    "preview": "#!/usr/bin/env python3\n# -*- coding:utf-8 -*-\n# Copyright (c) Megvii Inc. All rights reserved.\n\nimport cv2\nimport numpy "
  }
]

// ... and 1 more files (download for full content)

About this extraction

This page contains the full source code of the Megvii-BaseDetection/YOLOX GitHub repository, extracted and formatted as plain text for AI agents and large language models (LLMs). The extraction includes 154 files (613.3 KB), approximately 169.4k tokens, and a symbol index with 601 extracted functions, classes, methods, constants, and types. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.

Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.

Extract another repo