Tap to Trade in Gate Square, Win up to 50 GT & Merch!
Click the trading widget in Gate Square content, complete a transaction, and take home 50 GT, Position Experience Vouchers, or exclusive Spring Festival merchandise.
Click the registration link to join
https://www.gate.com/questionnaire/7401
Enter Gate Square daily and click any trading pair or trading card within the content to complete a transaction. The top 10 users by trading volume will win GT, Gate merchandise boxes, position experience vouchers, and more.
The top prize: 50 GT.

You don’t even need a professional department, anyone can do tracking on an amateur drone, because current object detection and image recognition technology is just too powerful.
The ubiquitous “eyes”
The reality is that our current world is full of cameras.
Among them, many companies have deployed very covert strategies to track consumers, all done through AI and visual recognition on video feeds.
For example, in Wal-Mart’s smart retail laboratory, IRL sensors and cameras allow staff to know everything in the store.
“Do you want to know how many people pass by a certain place in a certain period of time? What is their age and income status, and how many of them can become potential customers?”
Of course, the “crowd insight” service will emphasize that the data is anonymous, and the way the receipt is collected does not reveal personal privacy.
“I work on the backend security camera system at the stadium, and what we release to the public is only 1/3 of the actual data.”
** Pros and cons? **
In this regard, AI consulting expert Diego San Esteban shared his views:
In addition, AI can also provide objective performance data to avoid human bias in evaluation.
There are also many shortcomings. The most criticized one is the violation of employees’ privacy rights, and it will also create an atmosphere of distrust in the enterprise, which will affect morale and job satisfaction.
AI also cannot adequately understand the context in which work is being done, and lacks human empathy.
And, it is likely to make mistakes, subject to the inherent bias of the training data, which is extremely unfair to employees.
Target detection algorithm
In fact, behind this controversial incident is a very common AI technology-target detection.
These labels will contain the appropriate class for each object, such as “person”, and a “bounding box”, the rectangular area that completely encloses the object.
Object detection is a critical task for humans: when entering a new room or scene, our first instinct is to visually assess the objects and people in it and then understand them.
Similar to humans, object detection plays a vital role in enabling computers to understand and interact with the visual world, and has been widely used in many industries:
Object detection models can help improve workplace safety and security. For example, they can detect the presence of suspicious individuals or vehicles in sensitive areas. Even more creatively, it can ensure workers use personal protective equipment (PPE) such as gloves, helmets or masks.
- social media:
Object detection models can help identify the presence of a particular brand, product, logo, or person in digital media. Advertisers can use this information to gather data and show users more relevant ads. It also helps automate the process of detecting and flagging inappropriate or banned content.
- QC:
Object detection models enable automated review of visual data. Computers and cameras can analyze data in real time, automatically detecting and processing visual information and understanding its significance, reducing human intervention in tasks that require constant visual review. This is especially useful in manufacturing production quality control. Not only does it improve efficiency, it can also detect production anomalies that the human eye might miss, preventing potential production disruptions or product recalls.
Achieved 66 AP for the first time, the strongest SOTA algorithm dominates the list
Currently, in terms of the performance of the target detection algorithm, “DETRs with Collaborative Hybrid Assignments Training” from the domestic team dominates COCO with a score of 66 AP. This work has been accepted by ICCV 2023.
By training multiple parallel auxiliary heads (supervised by one-to-many label assignment, such as ATSS and Faster RCNN), the new Co-DETR can easily boost the learning ability of encoder in end-to-end detector.
By extracting positive coordinates from these auxiliary heads for additional customized positive queries, Co-DETR can also improve the training efficiency of positive samples in the decoder.
Moreover, these auxiliary heads are discarded during inference, so the method does not introduce additional parameters and computational cost to the original detector, and also does not require manual non-maximum suppression (NMS).
project address:
- Encoder optimization:
The training scheme can easily boost the learning ability of the encoder in the end-to-end detector by training multiple parallel auxiliary heads supervised by one-to-many label assignment.
- Codec optimization:
Attention learning in the decoder is improved by extracting positive coordinates from these auxiliary heads for additional custom positive queries.
- SOTA performance:
Co-DETR equipped with ViT-L (304M parameters) is the first model to achieve 66.0% AP on COCO test-dev.
With the support of ViT-L backbone network, Co-DETR achieves 66.0% AP on COCO test-dev and 67.9% AP on LVIS validation set.
In addition, Co-DETR also achieves better performance with a smaller model size than previous methods.