Bi-Weekly AI Research Roundup

Latest research summaries in ML, Robotics, CV, NLP and AI

State of AI

Sep 13, 2024

∙ Paid

ChatGPT's Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis Tools
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis
Generalization of Graph Neural Networks is Robust to Model Mismatch
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Human Perception of LLM-generated Text Content in Social Media Environments
Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
Approximation and generalization properties of the random projection classification method
Synthetic continued pretraining
Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination
AI-accelerated discovery of high critical temperature superconductors
DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering

ChatGPT's Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis Tools

Authors: Ehsan Firouzi, Mohammad Ghafari, Mike Ebrahimi

Source and references: https://arxiv.org/abs/2409.06561v1

Introduction

This paper investigates the potential of ChatGPT, a large language model, in detecting cryptography misuses in the Java Cryptography Architecture (JCA).

Key Points

The paper evaluates ChatGPT's performance in detecting a variety of cryptography misuses and compares it to the state-of-the-art static analysis tool, CryptoGuard.
The authors use the CryptoAPI-Bench benchmark to assess ChatGPT's capabilities and apply prompt engineering techniques to enhance its detection abilities.
The authors also evaluate the generalizability of their findings using a different benchmark, CAMBench.

Methodology

The researchers followed a four-step process to assess ChatGPT's JCA misuse detection capabilities. This included compiling security violation rules, analyzing gaps in the CryptoAPI-Bench, evaluating test cases with ChatGPT, and assessing additional code snippets. To improve ChatGPT's performance, the authors applied prompt engineering techniques based on insights from recent research.

Results and Findings

The initial results showed that ChatGPT achieved an average F-measure of 86% across 12 misuse categories, outperforming CryptoGuard in 5 categories. After prompt engineering, ChatGPT's average F-measure improved to 94.6%, outperforming CryptoGuard in 10 categories. The authors also confirmed the generalizability of their optimized prompts using the CAMBench.

Implications and Conclusions

The study highlights ChatGPT's promising potential for detecting cryptography misuse, with the ability to outperform the state-of-the-art static analysis tool, CryptoGuard, through the use of prompt engineering. This suggests that large language models like ChatGPT can be leveraged to enhance security practices in software development.

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis

Authors: Danli Shi, Weiyi Zhang, Jiancheng Yang, Siyu Huang, Xiaolan Chen, Mayinuer Yusufu, Kai Jin, Shan Lin, Shunming Liu, Qing Zhang, Mingguang He

Source and references: https://arxiv.org/abs/2409.06644v1

Introduction

This research paper presents EyeCLIP, a visual-language foundation model developed for multi-modal ophthalmic image analysis. EyeCLIP aims to leverage real-world multi-examination data and language information to enhance the analysis and diagnosis of ophthalmic and systemic diseases.

Get 7 day free trial

Keep reading with a 7-day free trial

Subscribe to State of AI to keep reading this post and get 7 days of free access to the full post archives.

Bi-Weekly AI Research Roundup

Latest research summaries in ML, Robotics, CV, NLP and AI

Contents

ChatGPT's Potential in Cryptography Misuse Detection: A Comparative Analysis with Static Analysis Tools

Introduction

Key Points

Methodology

Results and Findings

Implications and Conclusions

EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis

Introduction

Keep reading with a 7-day free trial