RadSEM - Radiology Sentence-Level Evaluation Metric

RadSEM is a semantic evaluation metric for radiology reports that breaks down reports into atomic sentences, aligns them between generated and reference reports, and computes detailed scores based on anatomical and abnormality relationships.

Overview

RadSEM evaluates radiology reports through three main steps:

Step 1 (Report processing): Converts reports into atomic sentences following strict rules
Step 2 (Sentence matching): Aligns sentences between generated and reference reports with detailed relationship labels
Step 3 (Scoring): Computes weighted F1 scores for abnormal and normal findings

Project Structure

RadSEM/
├── l1_l5/                # L1–L5 evaluation data and filtered samples
├── step/
│   ├── step1.py          # Report rewriting into atomic sentences
│   ├── step2.py          # Sentence matching and tagging
│   └── step3.py          # Score calculation
├── run_radsem.py         # Main pipeline orchestrator
├── groundtruth.jsonl     # Reference reports
└── model_output.jsonl    # Generated reports to evaluate

Installation

API Configuration

The scripts use an API for LLM-based processing. Update the API endpoint and key in step/step1.py:

url = "http://your/API/base/url"
headers = {
    "Authorization": "YOUR_API_KEY",
    ...
}

Usage

Quick Start

Run the complete pipeline:

python run_radsem.py

This will:

Process model_output.jsonl through step1 → model_rewritten_res.jsonl
Process groundtruth.jsonl through step1 → gt_rewritten_res.jsonl
Align and tag both → tag.jsonl
Compute scores → score.jsonl

Input Format

Report Files (JSONL)

Each line should be a JSON object with:

{
  "name": "sample_0001",
  "Examined_Area": "CHEST",
  "Examined_Type": "CT",
  "English_Report": "Both lungs are clear..."
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RadSEM - Radiology Sentence-Level Evaluation Metric

Overview

Project Structure

Installation

API Configuration

Usage

Quick Start

Input Format

Report Files (JSONL)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
l1_l5		l1_l5
step		step
README.md		README.md
groundtruth.jsonl		groundtruth.jsonl
model_output.jsonl		model_output.jsonl
run_radsem.py		run_radsem.py

Folders and files

Latest commit

History

Repository files navigation

RadSEM - Radiology Sentence-Level Evaluation Metric

Overview

Project Structure

Installation

API Configuration

Usage

Quick Start

Input Format

Report Files (JSONL)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages