近期新增论文

跟踪 arXiv 与关注会议/期刊的新论文，按发布时间浏览和检索。

近期新增论文

汇总 arXiv 与关注会议/期刊的新论文，检索时按相关性优先排序。

TOSEM Recent ArticlesArchitecture and DesignACM Transactions on Software Engineering and MethodologybenchmarkapproachDesign and architecture modeling and analysisACM TOSEMACM Transactions on Software Engineering and MethodologyComputer scienceExploit

AutoScope: Code Knowledge Enhanced Span-level Sampling for Distributed Tracing

Yulun Wu, Guangba Yu, Zhihan Jiang, Yichen Li, Michael R. Lyu

2026/07/24 08:00已过 2 天

Distributed tracing is essential for observing microservices but incurs prohibitive storage costs. Existing solutions primarily rely on trace-level sampling, which indiscriminately discards entire traces based on probability or tail latency. This coarse-grained approach often loses critical off-the-path anomalies and fails to retain normal baselines required for comparative diagnosis. To address these limitations, we introduce the concept of span-level sampling, a fine-grained paradigm that retains only essential spans rather than full traces. However, we identify a critical challenge in this new approach where naive span retention leads to structural ambiguity, as the removal of intermediate spans disrupts the graph topology and renders the remaining data useless for analysis. To resolve this, we propose AutoScope , a novel framework that leverages static code knowledge to enable structurally safe span-level sampling. AutoScope constructs a Bridged Call Site Control Flow Graph (CSCFG) to map runtime spans to static execution logic. By identifying Dominant Span Sets (DSS), it exploits the “Sample One, Infer All” property to enable aggressive compression while preserving the structural skeleton of every request. Extensive evaluations on two benchmark microservice systems demonstrate that AutoScope achieves an 80.2% storage reduction while maintaining a state-of-the-art faulty span coverage of 98.1%. Furthermore, the reconstructed traces enhance downstream Root Cause Analysis (RCA) tasks, improving the Mean Reciprocal Rank (MRR) by an average of 8.3%.

论文页

arXiv cs.CRDependability and SecurityarXivbenchmarkapproachFormal methods and model checkingcs.CRarXiv cs.CR

Where You Tap Matters: A Probe-and-Model Benchmark for Open-Set RF Fingerprinting

Gabriele Oligeri, Savio Sciancalepore, Ingrid Huso, Fatima Al-Mousawi

2026/07/24 01:48已过 3 天

Radio Frequency Fingerprint Identification (RFFI) enables transmitter identification at the physical layer by learning device-specific impairments from received signals, yet the literature is inconsistent about where in the receiver chain those samples should be collected. Since distinct transformations are applied to the signal by the different receiver operations, i.e., carrier recovery, gain normalization, pulse shaping, and timing recovery, they can either tighten within-transmitter variability or suppress the features RFFI requires for classification. We present a systematic real-world evaluation of open-set, reconstruction-error RFFI using data collected at five probe points along a standard BPSK receiver chain. Our results show that RFFI is strongly probe-dependent: timing recovery and, to a lesser extent, carrier recovery enable low false-acceptance operation with limited in-distribution-out-of-distribution overlap, whereas other stages often require a false-acceptance ratio above 0.1 to achieve a true-acceptance ratio of 0.9. To test the validity of our findings across model selection, we benchmark several LLM-designed autoencoders using a controlled pipeline that holds preprocessing and MSE scoring fixed. These architectures confirm that RFFI is probe-dependent. Moreover, they do not outperform the baseline at the chosen operating point and typically increase training time. Overall, probe selection dominates reconstruction-based open-set RFFI performance, more than the autoencoder complexity.

AutoScope: Code Knowledge Enhanced Span-level Sampling for Distributed Tracing

Where You Tap Matters: A Probe-and-Model Benchmark for Open-Set RF Fingerprinting

Unconditional Unclonable Encryption

From Resource Flow to Executable Tests: Petri-Net-Guided LLM Test Generation for Concurrent Stateful Rust APIs

Euclid-MCP: A Model Context Protocol Server for Deterministic Logical Reasoning via Prolog

Themis Consensus Extension v1: MEV Mitigation by Randomized Delayed Execution and Intent-Hiding Transactions in Application-Specific Blockchains

Toward Federated Cognitive Digital Twins over the Edge-to-Cloud Continuum

Teaching Business Process Modeling to Leverage Soft Skills of Computing Students

Toward cryptographically verifiable authorization for autonomous AI agents: A security hypothesis, preliminary formal model, and proof-of-concept implementation

Information is all you need: Requirements Engineering Quality Reframed

AI Assistants Overassist

An LLM-Driven Workflow for Automated Process Control Strategy Generation and Tuning from Dynamic Process Models

ICAE-Bench: Evaluating Coding Agents as Interactive Project Builders

What Bugs Do Prolog Students Write? An Empirical Taxonomy and Data-Driven Mutation Framework

Encoding Event-B Proof Rules in Prolog: An Interactive Sequent Prover for ProB

Advances in STV Margin Computation

Agree on the Model, Verify the Inference: GKR Protocols for HND-Based Transformer Inference

Improving Communication of Changes in Model-Based Engineering with Model-Independent Change Descriptions

Risk-Limiting Audits for Parliamentary Majorities

Maintenance Signals in AI-Assisted GitHub Repositories: Evidence from GenAI Adopters

HiMe: Real-Time Self-Hosted Personal Agent Platform for Health Insights with Wearable Devices

Weak Private Information Retrieval for Graph-based Storage

Delivery, Not Storage: Cue-Anchored Working Memory as a Harness Property for Coding Agents

Transformer-Assisted LLM-Based Source Code Summarisation: to Enable More Secure Software Development

The Consensus Number of Untraceable Cryptocurrencies

Tencent WorkBuddy Bench: A Multi-Domain Coding-Agent Benchmark with Contamination-Resistant Task Construction

Multi-turn RL with Structural and Performance Aware Rewards for CUDA Kernel Generation

Which Model Is Actually Serving You? IRIS: Budgeted Black-Box Auditing of Model Substitution and Routing Dilution in LLM Gateways

Beyond Heavy Log Curation: Perplexity-Based APT Detection via Unsupervised, Context-Augmented Language Models

Profiling Lightweight Large Language Models

Classical Acceptance Is Not Hybrid Authentication: Measuring X.509 Verifier Semantics in Post-Quantum Migration

IssueTrojanBench: Benchmarking AI Coding Agents Against Malicious Issue Requests

GPE: Evaluating Robust Evidence Aggregation for Fact Verification under Controllable GEO-Style Poisoning

Edit-Neighboring Data Streams and Privacy under Continual Observation

Leaky Language Models: Stealing Architecture and Inference Optimizations via Per-Token Timing

Security Vulnerability Patterns in AI-Generated Code: A Cross-Model Comparative Study

Evaluating Large Language Models for Symbolic Security Protocol Analysis

NVIDIA-labs OO Agents: Native Python Object-Oriented Agents

Buzz to Boom: Detecting Message Progression Vulnerabilities in Electron Applications via Segmented Directed Fuzzing

Learning to Detect UI Principle Violations via Reinforcement Learning

Enhancing Attack Detection Capabilities in BACnet/IP Networks Using Machine-Learning Models

WaveformQA: Benchmarking LLM Temporal Reasoning on Digital Waveforms

Demonstrating GenDB: Instance-Optimized and Customized Query Processing Code Generation via LLM Agents

Pure-DP Statistical Query Release at the Conjectured Square-Root Rate

The ICSE 2026 Shadow PC: Training the Next Generation of Reviewers Through Deliberate Practice

Multi-Source and Cross-Scenario Strategy-Guided Code Optimization

Constant-time decoding of Gabidulin codes and their generalizations with application to RQC

Don't Trust the Label: License Laundering in AI Supply Chains

Chained Attacks on Drone-Based Federated Learning: From Network Disruption to Device Impersonation

The Ethics of Autonomous AI Agents for Offensive Security

Small, Free, and Effective: Orchestrating Open-Weight Small Language Models to Outperform Single LLM for Malware Analysis

Multiparty Session Types for GDPR Purpose Compliance

How Developers Use Relation Chains in Gerrit-Based Review Ecosystems: An Empirical Study Across Three Open-Source Ecosystems

ISAC-Assisted Channel Knowledge Map Generation for Physical Layer Authentication

Multi-stage Dynamic Selection for Cross-Project Defect Prediction

PRO-LONG: Programmatic Memory Enables Long-Horizon Reasoning

Solar Open 2 Technical Report

Geometric Configurations of Perturbed Jailbreak Prompts

SequenceFI: Non-intrusive Temporal Fault Injection for Microservice Systems

Test Case Prioritization for DNNs via Neural Collapse Instability

Deepfake News Detection: A Multimodal Framework Integrating LipNet, DeepSpeech and ResNET for Enhanced Audio-Visual Analysis

Taming the Security-Energy Paradox: A Green AI Approach to Optimized Android Malware Detection

Towards Reliable C-to-Rust Translation with Rule-Guided Reasoning and Reinforcement Learning

HijackKV: New Threat in Position-Independent KV Cache Reuse

JANUS: Foreseeing Latent Risk for Long-Horizon Agent Safety

Defense Against LLM Backdoors using Critical Neuron Isolation Pruning

Adversarial Frontiers: Minimum-Norm Attack Ensembles for Robustness Evaluation

Beyond Fail-to-Pass: Iterative Hardening of Co-Generated Bug Reproduction Tests and Fixes

Know Your Agent: Reconnaissance-Driven Pentesting of AI Agents

DARWIN: Evolving Jailbreak Adversary and Guardrail for LLM Safety Evaluation and Protection

Towards Automated Formal Verification of zkEVMs Using LLM-Guided Constraint Synthesis

An Automated Framework for Extracting Reachable Attack Chains from Cyber Threat Intelligence Reports

Bridging Behavior and Implementation: Automated Java Glue Code Generation for Behavior-Driven Development

AuthProbe: Specification-Driven, Multi-Identity Detection of Broken Object-Level Authorization in Recruitment API

GhostPrompt: Cross-Image Adversarial Prompt for Vision-Language Models

Context Matters: Improving the Practical Reliability of LLM-Based Unit Test Generation

FedLSG: LLM-Enhanced Semantic Calibration for Federated Graph Backdoor Defense

PerfAgent: Profiler-Guided Iterative Refinement for Repository-Level Code Optimization

Human Attention During Localization of Memory Bugs in C Programs

Understanding Developer Pain Points in Federated Learning: Insights from Stack Overflow and GitHub