Jonathan Lalou's Blog

Archive for the ‘en-US’ Category

[DefCon32] Unlocking the Gates – Hacking a Secure Industrial Remote Access Solution

Moritz Abrell, a senior IT security consultant at Syss, exposes vulnerabilities in a widely deployed industrial VPN gateway critical to operational technology. By rooting the device, bypassing hardware security modules, and reverse-engineering firmware, Moritz demonstrates how attackers could hijack remote access sessions, threatening critical infrastructure worldwide. His findings underscore the fragility of industrial remote access solutions and the need for robust security practices.

Dissecting Industrial VPN Gateways

Moritz begins by outlining the role of VPN gateways in enabling secure remote access to industrial networks. These devices, often cloud-managed by vendors, connect service technicians to critical systems via VPN servers. However, their architecture presents a lucrative attack surface. Moritz’s analysis reveals how vulnerabilities in device firmware and authentication mechanisms allow attackers to gain root access, compromising entire networks.

Exploiting Firmware and Certificates

Through meticulous reverse engineering, Moritz uncovered methods to decrypt passwords and extract firmware-specific encryption keys. By forging valid VPN certificates, attackers could impersonate legitimate devices, redirecting user connections to malicious infrastructure. This scalability—potentially affecting over 500,000 devices—highlights the catastrophic potential of such exploits in energy plants, oil platforms, and other critical facilities.

Real-World Impact and Mitigation

Moritz’s attacks enabled eavesdropping on sensitive data, such as PLC programs, and disrupting legitimate connections. After responsibly disclosing these vulnerabilities, Syss prompted the vendor to patch the backend and release updated firmware. Moritz advises organizations to scrutinize cloud-based remote access solutions, verify third-party infrastructure, and implement strong authentication to mitigate similar risks.

Links:

Syss company website

Posted in en-US | Tags: CriticalInfrastructure, DefCon32, IndustrialSecurity, MoritzAbrell, RemoteAccess, Syss, VPNVulnerabilities | No Comments »

[DotAI2024] DotAI 2024: Armand Joulin – Elevating Compact Open Language Models to Frontier Efficacy

Author: Jonathan Lalou

Armand Joulin, Research Director at Google DeepMind overseeing Gemma’s open iterations, chronicled the alchemy of accessible intelligence at DotAI 2024. Transitioning from Meta’s EMEA stewardship—nurturing LLaMA, DINO, and FastText—Joulin now democratizes Gemini’s essence, crafting lightweight sentinels that rival titans thrice their heft. Gemma 2’s odyssey, spanning 2B to 27B parameters, exemplifies architectural finesse and pedagogical pivots, empowering myriad minds with potent, pliable cognition.

Reforging Architectures for Scalable Savvy

Joulin queried Google’s open gambit: why divulge amid proprietary prowess? The rejoinder: ubiquity. Developers dwell in open realms; arming them fosters diversity, curbing monopolies while seeding innovations that loop back—derivatives surpassing progenitors via communal cunning.

Gemma 2’s scaffold tweaks transformers: rotary embeddings for positional poise, attention refinements curbing quadratic quagmires. Joulin spotlighted the 2B and 9B variants, schooled not in next-token clairvoyance but auxiliary pursuits—masked modeling, causal contrasts—honing discernment over divination.

These evolutions yield compacts that converse competently: multilingual fluency, coding camaraderie, safety sans shackles. Joulin lauded derivatives: Hugging Face teems with Gemma-spun specialists, from role-play virtuosos to knowledge navigators, underscoring open’s osmotic gains.

Nurturing Ecosystems Through Pervasive Accessibility

Deployment’s democracy demands pervasiveness: Gemma graces Hugging Face, NVIDIA’s bastions, even AWS’s arches—agnostic to allegiance. Joulin tallied 20 million downloads in half a year, birthing a constellation of adaptations that eclipse originals in niches, a testament to collaborative cresting.

Use cases burgeon: multilingual muses for global dialogues, role enactors for immersive interfaces, knowledge curators for scholarly scaffolds. Joulin envisioned this as empowerment’s engine—students scripting savants, enthusiasts engineering epiphanies—where AI pockets transcend privilege.

In closing, Joulin affirmed open’s mandate: not largesse, but leverage—furnishing foundations for futures forged collectively, where size yields to sagacity.

Links:

Armand Joulin on LinkedIn

Posted in en-US | Tags: ArmandJoulin, DeepMind, DotAI2024, Gemma2, LanguageModels, OpenModels | No Comments »

[OxidizeConf2024] A Journey to Fullstack Mobile Game Development in Rust

Author: Jonathan Lalou

From C# to Rust: A Transformative Journey

The mobile gaming industry, long dominated by Unity and C#, is witnessing a shift toward open-source technologies that promise enhanced performance and developer experience. Stefan Dilly, founder of RustUnit, shared his five-year journey of adopting Rust for mobile game development at OxidizeConf2024. Stefan, a seasoned developer and maintainer of the open-source GitUI, traced his progression from integrating Rust libraries in a Go backend and C# frontend to building fullstack Rust mobile games, culminating in the launch of Zoolitaire, a testament to Rust’s growing viability in gaming.

Initially, Stefan’s team at GameRiser in 2019 used Rust for AI calculations within a Go backend, interfacing with a Unity-based C# frontend via a cumbersome C FFI and JSON serialization. This approach, while functional, was verbose and slow, hampered by Go’s garbage collector and Unity’s long iteration times. The challenges prompted a pivot to a Rust-based backend in late 2019, leveraging the stabilization of async/await. Despite early hurdles, such as a buggy MongoDB driver, this transition yielded a more robust server for games like Wheelie Royale, a multiplayer motorcycle racing game.

Advancing Frontend Integration

The next phase of Stefan’s journey focused on improving frontend integration. By replacing JSON with Protocol Buffers (protobuf), his team streamlined communication between Rust and Unity, reducing memory overhead and improving performance. This allowed shared code between backend and frontend, enhancing maintainability. However, Unity’s limitations, such as slow reload times, spurred Stefan to explore fullstack Rust solutions. The advent of the Bevy game engine, known for its Entity Component System (ECS) and WebGPU rendering, marked a turning point, enabling native Rust game development without Unity’s constraints.

Stefan showcased Zoolitaire, a mobile game built entirely in Rust using Bevy, featuring deterministic game logic shared between client and server. This ensures fairness by validating gameplay on the server, a critical feature for competitive games. The open-source Bevy plugins developed by RustUnit, supporting iOS-specific features like in-app purchases and notifications, further demonstrate Rust’s potential to deliver a complete gaming ecosystem. These plugins, available on GitHub, empower developers to create feature-rich mobile games with minimal dependencies.

The Future of Rust in Gaming

Looking ahead, Stefan envisions Rust playing a significant role in game development, particularly as companies seek alternatives to Unity’s licensing model. The Bevy engine’s rapid growth and community support make it a strong contender, though challenges remain, such as limited console support and the learning curve for Rust’s borrow checker. Stefan’s experience onboarding junior developers suggests that Rust’s reputation for complexity is overstated, as its safety features and clear error messages facilitate learning, especially for those without preconceived coding habits.

The launch of a new racing game at OxidizeConf2024, playable via a browser, underscores Rust’s readiness for mobile gaming. Stefan’s call to action—inviting attendees to beat his high score—reflects the community-driven spirit of Rust development. By open-sourcing critical components and fostering collaboration through platforms like Discord, Stefan is paving the way for Rust to challenge established game engines, offering developers a performant, safe, and open-source alternative.

Links:

Posted in en-US | Tags: Bevy, GameDevelopment, MobileGames, OxidizeConf2024, RustLang, RustUnit, StefanDilly, Zoolitaire | No Comments »

[DefCon32] What History’s Greatest Heist Can Teach Us About Defense In Depth

Author: Jonathan Lalou

Pete Stegemeyer, a seasoned security engineer and heist historian, draws parallels between the 2003 Antwerp Diamond Heist and cybersecurity’s defense-in-depth principles. By dissecting how thieves bypassed multiple security layers to steal millions in diamonds, gold, and cash, Pete illustrates the consequences of complacency and inadequate security practices. His narrative offers actionable lessons for fortifying digital defenses, blending historical intrigue with modern security insights.

Anatomy of the Antwerp Heist

Pete begins by recounting the audacious 2003 heist, where thieves used simple tools like hairspray and double-sided tape to defeat sophisticated vault security. The heist succeeded due to failures in physical security, such as outdated cameras and unmonitored access points. By mapping these lapses to cybersecurity, Pete underscores how neglected vulnerabilities—akin to unpatched software or weak access controls—can lead to catastrophic breaches.

Failures in Security Design

Delving deeper, Pete highlights how the vault’s reliance on single points of failure, like unsegmented keys, mirrored common cybersecurity oversights. The thieves exploited predictable patterns and lax enforcement, much like attackers exploit misconfigured systems or social engineering. Pete stresses that defense in depth requires layered protections, regular updates, and proactive monitoring to prevent such exploitation in digital environments.

Lessons for Cybersecurity

Drawing from the heist, Pete advocates for robust accountability mechanisms to combat complacency. Just as the vault’s operators failed to enforce key-splitting protocols, organizations often neglect security best practices. He recommends rigorous auditing, mandatory updates, and consequence-driven policies to ensure diligence. By treating data as valuable as diamonds, organizations can build resilient defenses against sophisticated threats.

Links:

None

Posted in en-US | Tags: AntwerpHeist, Cybersecurity, DefCon32, DefenseInDepth, PeteStegemeyer, SecurityFailures | No Comments »

[DefCon32] AMD Sinkclose – Universal Ring2 Privilege Escalation

Author: Jonathan Lalou

In the intricate landscape of system security, Enrique Nissim and Krzysztof Okupski, researchers from IOActive, uncover a critical vulnerability in AMD processors, dubbed Sinkclose. Their presentation delves into the shadowy realm of System Management Mode (SMM), a powerful x86 execution mode that operates invisibly to operating systems and hypervisors. By exposing a silicon-level flaw undetected for nearly two decades, Enrique and Krzysztof reveal a universal ring -2 privilege escalation exploit, challenging the robustness of modern CPU security mechanisms.

Understanding System Management Mode

Enrique opens by elucidating SMM, a privileged mode that initializes hardware during boot and resides in a protected memory region called SMRAM. Invisible to antivirus, endpoint detection and response (EDR) systems, and anti-cheat engines, SMM’s isolation makes it a prime target for attackers seeking to deploy bootkits or firmware implants. The researchers explain how AMD’s security mechanisms, designed to safeguard SMM, falter due to a fundamental design flaw, enabling unauthorized access to this critical layer.

Exploiting the Sinkclose Vulnerability

Krzysztof details the methodology behind exploiting Sinkclose, a flaw in a critical SMM component. By reverse-engineering AMD’s processor architecture, they crafted an exploit that achieves arbitrary code execution in ring -2, bypassing even hypervisor-level protections. Their approach leverages precise engineering to manipulate SMRAM, demonstrating how attackers could install persistent malware undetectable by conventional defenses. The vulnerability’s longevity underscores the challenges in securing silicon-level components.

Implications for Critical Systems

The impact of Sinkclose extends to devices like the PlayStation 5, though its hypervisor mitigates some risks by trapping specific register accesses. Enrique emphasizes that the exploit’s ability to evade kernel and hypervisor defenses poses significant threats to critical infrastructure, gaming platforms, and enterprise systems. Their findings, promptly reported to AMD, prompted microcode updates, though the researchers note the complexity of fully mitigating such deep-seated flaws.

Future Directions for CPU Security

Concluding, Krzysztof advocates for enhanced firmware validation and real-time monitoring of SMM interactions. Their work highlights the need for vendors to prioritize silicon-level security and for researchers to probe low-level components for hidden weaknesses. By sharing their exploit methodology, Enrique and Krzysztof empower the community to strengthen defenses against similar vulnerabilities, ensuring robust protection for modern computing environments.

Links:

IOActive company website

Posted in en-US | Tags: AMDSinkclose, CPUSecurity, DefCon32, EnriqueNissim, IOActive, KrzysztofOkupski, PrivilegeEscalation, SMMVulnerability | No Comments »

[DefCon32] Breaching AWS Through Shadow Resources

Author: Jonathan Lalou

The complexity of cloud environments conceals subtle vulnerabilities, and Yakir Kadkoda, Michael Katchinskiy, and Ofek Itach from Aqua Security reveal how shadow resources in Amazon Web Services (AWS) can be exploited. Their research uncovers six critical vulnerabilities, ranging from remote code execution to information disclosure, enabling potential account takeovers. By mapping internal APIs and releasing an open-source tool, Yakir, Michael, and Ofek empower researchers to probe cloud systems while offering developers robust mitigation strategies.

Uncovering Shadow Resource Vulnerabilities

Yakir introduces shadow resources—services that rely on others, like S3 buckets, for operation. Their research identified vulnerabilities in AWS services, including CloudFormation, Glue, and EMR, where misconfigured buckets allowed attackers to assume admin roles. One severe flaw enabled remote code execution, potentially compromising entire accounts. By analyzing service dependencies, Yakir’s team developed a methodology to uncover these hidden risks systematically.

Mapping and Exploiting Internal APIs

Michael details their approach to mapping AWS’s internal APIs, identifying common patterns that amplify vulnerability impact. Their open-source tool, released during the talk, automates this process, enabling researchers to detect exposed resources. For instance, unclaimed S3 buckets could be hijacked, allowing attackers to manipulate data or escalate privileges. This methodical mapping exposed systemic flaws, highlighting the need for vigilant resource management.

Mitigation Strategies for Cloud Security

Ofek outlines practical defenses, such as using scoped IAM policies with resource account conditions to restrict access to trusted buckets. He recommends verifying bucket ownership with expected bucket owner headers and using randomized bucket names to deter hijacking. These measures, applicable to open-source projects, prevent dangling resources from becoming attack vectors. Ofek emphasizes proactive checks to ensure past vulnerabilities are addressed.

Future Research and Community Collaboration

The trio concludes by urging researchers to explore new cloud attack surfaces, particularly internal API dependencies. Their open-source tool fosters community-driven discovery, encouraging developers to adopt secure practices. By sharing their findings, Yakir, Michael, and Ofek aim to strengthen AWS environments, ensuring that shadow resources no longer serve as gateways for catastrophic breaches.

Links:

Aqua Security company website

Posted in en-US | Tags: AquaSecurity, AWSVulnerabilities, CloudSecurity, DefCon32, MichaelKatchinskiy, OfekItach, PrivilegeEscalation, ShadowResources, YakirKadkoda | No Comments »

[DotJs2024] Remixing the Bundler

Author: Jonathan Lalou

The demarcation between libraries and frameworks is dissolving, yielding hybrid entities that democratize web app assembly. Mark Dalgleish, co-creator of CSS Modules and core Remix team member from Melbourne, unpacked this metamorphosis at dotJS 2024. Amid jet-lag’s haze—bedtime Down Under—Dalgleish chronicled Remix’s pivot from bundler overlord to Vite plugin, illuminating tradeoffs and consumer boons in an ecosystem where React’s library purity meets framework ambitions.

Dalgleish contextualized via React docs: libraries like React invite framework augmentation for routing, data, builds. React Router, a routing stalwart since React’s infancy (halving installs pair it), contrasts Remix’s full-stack ethos—CLI-driven dev/builds, config files dictating structure, file-based routing. Yet, Remix bootstraps atop Router, exporting its APIs for familiarity: “React Router, the framework.” Bundler ownership defined early Remix: esbuild’s velocity supplanted Rollup, plugins added TypeScript, assets, HMR, polyfills. Dalgleish confessed: stewarding bundlers diverts from core missions, spawning endless edge requests—new loaders, plugins—while craving low-level tweaks.

Vite’s ascent inverted norms: frameworks as plugins, not vice versa. Remix’s migration yielded: dev servers with HMR, preview pipelines, configurable builds—Vite’s gifts. Plugins now encode conventions: Remix’s dictates dev flows, output paths, sans CLI bloat. App code imports Router directly, indirection erased; Remix’s loaders, actions, nested routing infuse the library. React Conf 2024 announced fusion: Remix upstreams into Router, empowering half React’s users with framework superpowers—optional via Vite plugin. Dalgleish reframed: Router remains library-flexible (serverless, static), conventions pluggable.

Implications abound: Vite’s vibrant community accelerates frameworks; simplicity shrinks footprints—delete years of code. Barrier-lowered authorship invites experimentation; consumers port plugins across Vite realms, learning once. Maintainers unburden, focusing essence. Dalgleish hailed Vite’s platform ethos—Evan You’s spark, team’s nurture—propelling Remix’s (now Router’s) trajectory.

Blurring Library-Framework Contours

Dalgleish traced Remix’s esbuild era: speed wins, but ownership’s toll—HMR hacks, polyfill webs—eclipsed. Vite plugins liberate: conventions as hooks, library code unadorned. Router’s merger embeds Remix’s data mutations, streaming, into routing’s heart—framework sans moniker.

Vite’s Empowerment for Builders

Vite furnishes dev/build scaffolds; plugins customize sans reinvention. Dalgleish envisioned: nascent frameworks plugin-first, ecosystems interoperable. Consumers gain portability—plugins transfer, features standardize—while authors prune maintenance, amplifying innovation.

Links:

Posted in en-US | Tags: Bundlers, dotJS2024, Frameworks, JavaScript, MarkDalgleish, ReactRouter, Remix, Vite, WebDev | No Comments »

[DotJs2025] Node.js Will Use All the Memory Available, and That’s OK!

Author: Jonathan Lalou

In the pulsating heart of server-side JavaScript, where applications hum under relentless loads, a persistent myth endures: Node.js’s voracious appetite for RAM signals impending doom. Matteo Collina, co-founder and CTO at Platformatic, dismantled this notion at dotJS 2025, revealing how V8’s sophisticated heap stewardship—far from a liability—empowers resilient, high-throughput services. With over 15 years sculpting performant ecosystems, including Fastify’s lean framework and Pino’s swift logging, Matteo illuminated the elegance of embracing memory as a strategic asset, not an adversary. His revelation: judicious tuning transforms perceived excess into a catalyst for latency gains and stability, urging developers to recalibrate preconceptions for enterprise-grade robustness.

Matteo commenced with a ritual lament: weekly pleas from harried coders convinced their apps hemorrhage resources, only to confess manual terminations at arbitrary thresholds—no crashes, merely preempted panics. This vignette unveiled the crux: Node’s default 1.4GB cap (64-bit) isn’t a leak’s harbinger but a deliberate throttle, safeguarding against unchecked sprawl. True leaks—orphaned closures, eternal event emitters—defy GC’s mercy, accruing via retain cycles. Yet, most “leaks” masquerade as legitimate growth: caches bloating under traffic, buffers queuing async floods. Matteo advocated profiling primacy: Chrome DevTools’ heap snapshots, clinic.js’s flame charts—tools unmasking culprits sans conjecture.

Delving into V8’s bowels, Matteo traced the Orinoco collector’s cadence: minor sweeps scavenging new-space detritus, majors consolidating old-space survivors. Latency lurks in these pauses; unchecked heaps amplify them, stalling event loops. His panacea: hoist the ceiling via --max-old-space-size=4096, bartering RAM for elongated intervals between majors. Benchmarks corroborated: a 4GB tweak on a Fastify benchmark slashed P99 latency by 8-10%, throughput surging analogously—thinner GC curves yielding smoother sails. This alchemy, Matteo posited, flips economics: memory’s abundance (cloud’s elastic reservoirs) trumps compute’s scarcity, especially as SSDs eclipse HDDs in I/O velocity.

Enterprise vignettes abounded. Platformatic’s observability suite, Pino’s zero-allocation streams—testaments to lean design—thrive sans austerity. Matteo cautioned: leaks persist, demanding vigilance—nullify globals, prune listeners, wield weak maps for caches. Yet, fear not the fullness; it’s V8’s vote of confidence in your workload’s vitality. As Kubernetes autoscalers and monitoring recipes (his forthcoming tome’s bounty) democratize, Node’s memory ethos evolves from taboo to triumph.

Demystifying Heaps and Collectors

Matteo dissected V8’s realms: new-space for ephemeral allocations, old-space for tenured stalwarts—Orinoco’s incremental majors mitigating stalls. Defaults constrain; elevations liberate, as 2025’s guides affirm: monitor via --inspect, profile with heapdump.js, tuning for 10% latency dividends sans leaks.

Trading Bytes for Bandwidth

Empirical edges: Fastify’s trials evince heap hikes yielding throughput boons, GC pauses pruned. Platformatic’s ethos—frictionless backends—embodies this: Pino’s streams, Fastify’s routers, all memory-savvy. Matteo’s gift: enterprise blueprints, from K8s scaling to on-prem Next.js, in his 296-page manifesto.

Links:

Posted in en-US | Tags: dotJS2025, Enterprise, Fastify, GarbageCollection, MatteoCollina, MemoryManagement, NodeJS, Performance, Platformatic, V8 | No Comments »

[NodeCongress2024] Strategies for High-Performance Node.js API Microservices

Author: Jonathan Lalou

Lecturer: Tamar Twena-Stern

Tamar Twena-Stern is an experienced software professional, serving as a developer, manager, and architect with a decade of expertise spanning server-side development, big data, mobile, web technologies, and security. She possesses a deep specialization in Node.js server architecture and performance optimization. Her work is centered on practical strategies for improving Node.js REST API performance, encompassing areas from database interaction and caching to efficient framework and library selection.

Relevant Links:
* GitNation Profile (Talks): https://gitnation.com/person/tamar_twenastern
* Lecture Video: Implementing a performant URL parser from scratch

Abstract

This article systematically outlines and analyzes key strategies for optimizing the performance of Node.js-based REST API microservices, a requirement necessitated by the high concurrency demands of modern, scalable web services. The analysis is segmented into three primary areas: I/O optimization (database access and request parallelism), data locality and caching, and strategic library and framework selection. Key methodologies, including the use of connection pooling, distributed caching with technologies like Redis, and the selection of low-overhead utilities (e.g., Fastify and Pino), are presented as essential mechanisms for minimizing latency and maximizing API throughput.

Performance Engineering in Node.js API Architecture

I/O Optimization: Database and Concurrency

The performance of a Node.js API is heavily constrained by Input/Output (I/O) operations, particularly those involving database queries or external network requests. Optimizing this layer is paramount for achieving speed at scale:

Database Connection Pooling: At high transaction volumes, the overhead of opening and closing a new database connection for every incoming request becomes a critical bottleneck. The established pattern of connection pooling is mandatory, as it enables the reuse of existing, idle connections, significantly reducing connection establishment latency.
Native Drivers vs. ORMs: For applications operating at large scale, performance gains can be realized by preferring native database drivers over traditional Object-Relational Mappers (ORMs). While ORMs offer abstraction and development convenience, they introduce an layer of overhead that can be detrimental to raw request throughput.
Parallel Execution: Latency within a single request often results from sequential execution of independent I/O tasks (e.g., multiple database queries or external service calls). The implementation of Promise.all allows for the parallel execution of these tasks, ensuring that the overall response time is determined by the slowest task, rather than the sum of all tasks.
Query Efficiency: Fundamental to performance is ensuring an efficient database architecture and optimizing all underlying database queries.

Data Locality and Caching Strategies

Caching is an essential architectural pattern for reducing I/O load and decreasing request latency for frequently accessed or computationally expensive data.

Distributed Caching: In-memory caching is strongly discouraged for services deployed in multiple replicas or instances, as it leads to data inconsistency and scalability issues. The professional standard is distributed caching, utilizing technologies such as Redis or etcd. A distributed cache ensures all service instances access a unified, shared source of cached data.
Cache Candidates: Data recommended for caching includes results of complex DB queries, computationally intensive cryptographic operations (e.g., JWT parsing), and external HTTP requests.

Strategic Selection of Runtime Libraries

The choice of third-party libraries and frameworks has a profound impact on the efficiency of the Node.js event loop.

Web Framework Selection: Choosing a high-performance HTTP framework is a fundamental optimization. Frameworks like Fastify or Hapi offer superior throughput and lower overhead compared to more generalized alternatives like Express.
Efficient Serialization: Performance profiling reveals that JSON serialization can be a significant bottleneck when handling large payloads. Utilizing high-speed serialization libraries, such as Fast-JSON-Stringify, can replace the slower, default JSON.stringify to drastically improve response times.
Logging and I/O: Logging is an I/O operation and, if handled inefficiently, can impede the main thread. The selection of a high-throughput, low-overhead logging utility like Pino is necessary to mitigate this risk.
Request Parsing Optimization: Computational tasks executed on the main thread, such as parsing components of an incoming request (e.g., JWT token decoding), should be optimized, as they contribute directly to request latency.

Links

Lecture Video: JS Perf Wins & New Node.js Features with Yagiz Nizipli – Syntax #716
Lecturer’s GitNation Profile: https://gitnation.com/person/tamar_twenastern

Posted in en-US | Tags: APIOptimization, DistributedCaching, Fastify, Microservices, NodeCongress2024, NodeJSPerformance, WebFrameworks | No Comments »

[DefCon32] Threat Modeling in the Age of AI

Author: Jonathan Lalou

As artificial intelligence (AI) reshapes technology, Adam Shostack, a renowned threat modeling expert, explores its implications for security. Speaking at the AppSec Village, Adam examines how traditional threat modeling adapts to large language models (LLMs), addressing real-world risks like biased hiring algorithms and deepfake misuse. His practical approach demystifies AI security, offering actionable strategies for researchers and developers to mitigate vulnerabilities in an AI-driven world.

Foundations of Threat Modeling

Adam introduces threat modeling’s four-question framework: what are we working on, what can go wrong, what are we going to do about it, and did we do a good job? This structured approach, applicable to any system, helps identify vulnerabilities in LLMs. By creating simplified system models, researchers can map AI components, such as training data and inference pipelines, to pinpoint potential failure points, ensuring a proactive stance against emerging threats.

AI-Specific Security Challenges

Delving into LLMs, Adam highlights unique risks stemming from their design, particularly the mingling of code and data. This architecture complicates secure deployment, as malicious inputs can exploit model behavior. Real-world issues, such as AI-driven resume screening biases or facial recognition errors leading to wrongful arrests, underscore the urgency of robust threat modeling. Adam notes that while LLMs excel at specific mitigation tasks, broad security questions yield poor results, necessitating precise queries.

Leveraging AI for Security Solutions

Adam explores how LLMs can enhance security practices. By generating mitigation code or test cases for specific vulnerabilities, AI can assist developers in fortifying systems. However, he cautions against over-reliance, as generic queries produce unreliable outcomes. His approach involves using AI to streamline threat identification while maintaining human oversight, ensuring that mitigations address tangible risks like data leaks or model poisoning.

Future Directions and Real-World Impact

Concluding, Adam dismisses apocalyptic AI fears but stresses immediate concerns, such as deepfake proliferation and biased decision-making. He advocates integrating threat modeling into AI development to address these issues early. By fostering a collaborative community effort, Adam encourages researchers to refine AI security practices, ensuring that LLMs serve as tools for progress rather than vectors for harm.

Links:

Adam Shostack on LinkedIn

Posted in en-US | Tags: AdamShostack, AISecurity, Cybersecurity, DefCon32, LLMs, ThreatModeling | No Comments »