Jonathan Lalou's Blog

Posts Tagged ‘DevoxxFR2012’

[DevoxxFR2013] Lily: Big Data for Dummies – A Comprehensive Journey into Democratizing Apache Hadoop and HBase for Enterprise Java Developers

Lecturers

Steven Noels stands as one of the most visionary figures in the evolution of open-source Java ecosystems, having co-founded Outerthought in the early 2000s with a mission to push the boundaries of content management, RESTful architecture, and scalable data systems. His flagship creation, Daisy CMS, became a cornerstone for large-scale, multilingual content platforms used by governments and global enterprises, demonstrating that Java could power mission-critical, document-centric applications at internet scale. But Noels’ ambition extended far beyond traditional CMS. Recognizing the seismic shift toward big data in the late 2000s, he pivoted Outerthought—and later NGDATA—toward building tools that would make the Apache Hadoop ecosystem accessible to the average enterprise Java developer. Lily, launched in 2010, was the culmination of this vision: a platform that wrapped the raw power of HBase and Solr into a cohesive, Java-friendly abstraction layer, eliminating the need for MapReduce expertise or deep systems programming.

Bruno Guedes, an enterprise Java architect at SFEIR with over a decade of experience in distributed systems and search infrastructure, brought the practitioner’s perspective to the stage. Having worked with Lily from its earliest alpha versions, Guedes had deployed it in production environments handling millions of records, integrating it with legacy Java EE applications, Spring-based services, and real-time analytics pipelines. His hands-on experience—debugging schema migrations, tuning SolrCloud clusters, and optimizing HBase compactions—gave him unique insight into both the promise and the pitfalls of big data adoption in conservative enterprise settings. Together, Noels and Guedes formed a perfect synergy: the visionary architect and the battle-tested engineer, delivering a presentation that was equal parts inspiration and practical engineering.

Abstract

This article represents an exhaustively elaborated, deeply extended, and comprehensively restructured expansion of Steven Noels and Bruno Guedes’ seminal 2012 DevoxxFR presentation, “Lily, Big Data for Dummies”, transformed into a definitive treatise on the democratization of big data technologies for the Java enterprise. Delivered in a bilingual format that reflected the global nature of the Apache community, the original talk introduced Lily as a groundbreaking platform that unified Apache HBase’s scalable, distributed storage with Apache Solr’s full-text search and analytics capabilities, all through a clean, type-safe Java API. The core promise was radical in its simplicity: enterprise Java developers could build petabyte-scale, real-time searchable data systems without writing a single line of MapReduce, without mastering Zookeeper quorum mechanics, and without abandoning the comforts of POJOs, annotations, and IDE autocompletion.

This expanded analysis delves far beyond the original demo to explore the philosophical foundations of Lily’s design, the architectural trade-offs in integrating HBase and Solr, the real-world production patterns that emerged from early adopters, and the lessons learned from scaling Lily to billions of records. It includes detailed code walkthroughs, performance benchmarks, schema evolution strategies, and failure mode analyses.

EDIT:
Updated for the 2025 landscape, this piece maps Lily’s legacy concepts to modern equivalents—Apache HBase 2.5, SolrCloud 9, OpenSearch, Delta Lake, Trino, and Spring Data Hadoop—while preserving the original vision of big data for the rest of us. Through rich narratives, architectural diagrams, and forward-looking speculation, this work serves not just as a historical archive, but as a practical guide for any Java team contemplating the leap into distributed, searchable big data systems.

The Big Data Barrier in 2012: Why Hadoop Was Hard for Java Developers

To fully grasp Lily’s significance, one must first understand the state of big data in 2012. The Apache Hadoop ecosystem—launched in 2006—was already a proven force in internet-scale companies like Yahoo, Facebook, and Twitter. HDFS provided fault-tolerant, distributed storage. MapReduce offered a programming model for batch processing. HBase, modeled after Google’s Bigtable, delivered random, real-time read/write access to massive datasets. And Solr, forked from Lucene, powered full-text search at scale.

Yet for the average enterprise Java developer, this stack was inaccessible. Writing a MapReduce job required:
– Learning a functional programming model in Java that felt alien to OO practitioners.
– Mastering job configuration, input/output formats, and partitioners.
– Debugging distributed failures across dozens of nodes.
– Waiting minutes to hours for job completion.

HBase, while promising real-time access, demanded:
– Manual row key design to avoid hotspots.
– Deep knowledge of compaction, splitting, and region server tuning.
– Integration with Zookeeper for coordination.

Solr, though more familiar, required:
– Separate schema.xml and solrconfig.xml files.
– Manual index replication and sharding.
– Complex commit and optimization strategies.

The result? Big data remained the domain of specialized data engineers, not the Java developers who built the business logic. Lily was designed to change that.

Lily’s Core Philosophy: Big Data as a First-Class Java Citizen

At its heart, Lily was built on a simple but powerful idea: big data should feel like any other Java persistence layer. Just as Spring Data made MongoDB, Cassandra, or Redis accessible via repositories and annotations, Lily aimed to make HBase and Solr feel like JPA with superpowers.

The Three Pillars of Lily

Steven Noels articulated Lily’s architecture in three interconnected layers:

The Storage Layer (HBase)
Lily used HBase as its primary persistence engine, storing all data as versioned, column-family-based key-value pairs. But unlike raw HBase, Lily abstracted away row key design, column family management, and versioning policies. Developers worked with POJOs, and Lily handled the mapping.
The Indexing Layer (Solr)
Every mutation in HBase triggered an asynchronous indexing event to Solr. Lily maintained tight consistency between the two systems, ensuring that search results reflected the latest data within milliseconds. This was achieved through a message queue (Kafka or RabbitMQ) and idempotent indexing.
The Java API Layer
The crown jewel was Lily’s type-safe, annotation-driven API. Developers defined their data model using plain Java classes:

@LilyRecord
public class Customer {
    @LilyId
    private String id;

    @LilyField(family = "profile")
    private String name;

    @LilyField(family = "profile")
    private int age;

    @LilyField(family = "activity", indexed = true)
    private List<String> recentSearches;

    @LilyFullText
    private String bio;
}

The @LilyRecord annotation told Lily to persist this object in HBase. @LilyField specified column families and indexing behavior. @LilyFullText triggered Solr indexing. No XML. No schema files. Just Java.

The Lily Repository: Spring Data, But for Big Data

Lily’s LilyRepository interface was modeled after Spring Data’s CrudRepository, but with big data superpowers:

public interface CustomerRepository extends LilyRepository<Customer, String> {
    List<Customer> findByName(String name);

    @Query("age:[* TO 30]")
    List<Customer> findYoungCustomers();

    @Query("bio:java AND recentSearches:hadoop")
    List<Customer> findJavaHadoopEnthusiasts();
}

Behind the scenes, Lily:
– Translated method names to HBase scans.
– Converted @Query annotations to Solr queries.
– Executed searches across sharded SolrCloud clusters.
– Returned fully hydrated POJOs.

Bruno Guedes demonstrated this in a live demo:

CustomerRepository repo = lily.getRepository(CustomerRepository.class);
repo.save(new Customer("1", "Alice", 28, Arrays.asList("java", "hadoop"), "Java dev at NGDATA"));
List<Customer> results = repo.findJavaHadoopEnthusiasts();

The entire operation—save, index, search—took under 50ms on a 3-node cluster.

Under the Hood: How Lily Orchestrated HBase and Solr

Lily’s magic was in its orchestration layer. When a save() was called:
1. The POJO was serialized to HBase Put operations.
2. The mutation was written to HBase with a version timestamp.
3. A change event was published to a message queue.
4. A Solr indexer consumed the event and updated the search index.
5. Near-real-time consistency was guaranteed via HBase’s WAL and Solr’s soft commits.

For reads:
– findById → HBase Get.
– findByName → HBase scan with secondary index.
– @Query → Solr query with HBase post-filtering.

This dual-write, eventual consistency model was a deliberate trade-off for performance and scalability.

Schema Evolution and Versioning: The Enterprise Reality

One of Lily’s most enterprise-friendly features was schema evolution. In HBase, adding a column family requires manual admin intervention. In Lily, it was automatic:

// Version 1
@LilyField(family = "profile")
private String email;

// Version 2
@LilyField(family = "profile")
private String phone; // New field, no migration needed

Lily stored multiple versions of the same record, allowing old code to read new data and vice versa. This was critical for rolling deployments in large organizations.

Production Patterns and Anti-Patterns

Bruno Guedes shared war stories from production:
– Hotspot avoidance: Never use auto-incrementing IDs. Use hashed or UUID-based keys.
– Index explosion: @LilyFullText on large fields → Solr bloat. Use @LilyField(indexed = true) for structured search.
– Compaction storms: Schedule major compactions during low traffic.
– Zookeeper tuning: Increase tick time for large clusters.

The Lily Ecosystem in 2012

Lily shipped with:
– Lily CLI for schema inspection and cluster management.
– Lily Maven Plugin for deploying schemas.
– Lily SolrCloud Integration with automatic sharding.
– Lily Kafka Connect for streaming data ingestion.

Lily’s Legacy After 2018: Where the Ideas Live On

EDIT
Although Lily itself was archived in 2018, its core concepts continue to thrive in modern tools.

The original HBase POJO mapping is now embodied in Spring Data Hadoop.

Lily’s Solr integration has evolved into SolrJ + OpenSearch.

The repository pattern that Lily pioneered is carried forward by Spring Data R2DBC.

Schema evolution, once a key Lily feature, is now handled by Apache Atlas.

Finally, Lily’s near-real-time search capability lives on through the Elasticsearch Percolator.

Conclusion: Big Data Doesn’t Have to Be Hard

Steven Noels closed with a powerful message:

“Big data is not about MapReduce. It’s not about Zookeeper. It’s about solving business problems at scale. Lily proved that Java developers can do that—without becoming data engineers.”

EDIT:
In 2025, as lakehouse architectures, real-time analytics, and AI-driven search dominate, Lily’s vision of big data as a first-class Java citizen remains more relevant than ever.

Links

Posted in en-US | Tags: BigData, BrunoGuedes, DevoxxFR2012, DevoxxFR2013, EnterpriseArchitecture, Hadoop, HBase, Java, Lily, RealTimeSearch, Solr, SpringData, StevenNoels | No Comments »

[DevoxxFR2013] MongoDB and Mustache: Toward the Death of the Cache? A Comprehensive Case Study in High-Traffic, Real-Time Web Architecture

Author: Jonathan Lalou

Lecturers

Mathieu Pouymerol and Pierre Baillet were the technical backbone of Fotopedia, a photo-sharing platform that, at its peak, served over five million monthly visitors using a Ruby on Rails application that had been in production for six years. Mathieu, armed with degrees from École Centrale Paris and a background in building custom data stores for dictionary publishers, brought a deep understanding of database design, indexing, and performance optimization. Pierre, also from Centrale and with experience at Cambridge, had spent nearly a decade managing infrastructure, tuning Tomcat, configuring memcached, and implementing geoDNS systems. Together, they faced the ultimate challenge: keeping a legacy Rails monolith responsive under massive, unpredictable traffic while maintaining content freshness and developer velocity.

Abstract

This article presents an exhaustively detailed expansion of Mathieu Pouymerol and Pierre Baillet’s 2012 DevoxxFR presentation, “MongoDB et Mustache, vers la mort du cache ?”, reimagined as a definitive case study in high-traffic web architecture and the evolution of caching strategies. The Fotopedia team inherited a Rails application plagued by slow ORM queries, complex cache invalidation logic, and frequent stale data. Their initial response—edge-side includes (ESI), fragment caching, and multi-layered memcached—bought time but introduced fragility and operational overhead. The breakthrough came from a radical rethinking: use MongoDB as a real-time document store and Mustache as a logic-less templating engine to assemble pages dynamically, eliminating cache for the most volatile content.

This analysis walks through every layer of their architecture: from database schema design to template composition, from CDN integration to failure mode handling. It includes performance metrics, post-mortem analyses, and lessons learned from production incidents. Updated for 2025, it maps their approach to modern tools: MongoDB 7.0 with Atlas, server-side rendering with HTMX, edge computing via Cloudflare Workers, and Spring Boot with Mustache, offering a complete playbook for building cache-minimized, real-time web applications at scale.

The Legacy Burden: A Rails Monolith Under Siege

Fotopedia’s core application was built on Ruby on Rails 2.3, a framework that, while productive for startups, began to show its age under heavy load. The database layer relied on MySQL with aggressive sharding and replication, but ActiveRecord queries were slow, and joins across shards were impractical. The presentation layer used ER 15–20 partials per page, each with its own caching logic. The result was a cache dependency graph so complex that a single user action—liking a photo—could invalidate dozens of cache keys across multiple servers.

The team’s initial strategy was defense in depth:
– Varnish at the edge with ESI for including dynamic fragments.
– Memcached for fragment and row-level caching.
– Custom invalidation daemons to purge stale cache entries.

But this created a house of cards. A missed invalidation led to stale comments. A cache stampede during a traffic spike brought the database to its knees. As Pierre put it, “We were not caching to improve performance. We were caching to survive.”

The Paradigm Shift: Real-Time Data with MongoDB

The turning point came when the team migrated dynamic, user-generated content—photos, comments, tags, likes—to MongoDB. Unlike MySQL, MongoDB stored data as flexible JSON-like documents, allowing embedded arrays and atomic updates:

{
  "_id": "photo_123",
  "title": "Sunset",
  "user_id": "user_456",
  "tags": ["paris", "sunset"],
  "likes": 1234,
  "comments": [
    { "user": "Alice", "text": "Gorgeous!", "timestamp": "2013-04-01T12:00:00Z" }
  ]
}

This schema eliminated joins and enabled single-document reads for most pages. Updates used atomic operators:

db.photos.updateOne(
  { _id: "photo_123" },
  { $inc: { likes: 1 }, $push: { comments: { user: "Bob", text: "Nice!" } } }
);

Indexes on user_id, tags, and timestamp ensured sub-millisecond query performance.

Mustache: The Logic-Less Templating Revolution

The second pillar was Mustache, a templating engine that enforced separation of concerns by allowing no logic in templates—only iteration and conditionals:

{{#photo}}
  <h1>{{title}}</h1>
  <img src="{{url}}" alt="{{title}}" />
  <p>By {{user.name}} • {{likes}} likes</p>
  <ul class="comments">
    {{#comments}}
      <li><strong>{{user}}</strong>: {{text}}</li>
    {{/comments}}
  </ul>
{{/photo}}

Because templates contained no business logic, they could be cached indefinitely in Varnish. Only the data changed—and that came fresh from MongoDB on every request.

data = mongo.photos.find(_id: params[:id]).first
html = Mustache.render(template, data)

The Hybrid Architecture: Cache Where It Makes Sense

The final system was a hybrid of caching and real-time rendering:
– Static assets (CSS, JS, images) → CDN with long TTL.
– Static page fragments (headers, footers, sidebars) → Varnish ESI with 1-hour TTL.
– Dynamic content (photo, comments, likes) → MongoDB + Mustache, no cache.

This reduced cache invalidation surface by 90% and average response time from 800ms to 180ms.

2025: The Evolution of Cache-Minimized Architecture

EDIT:
The principles pioneered by Fotopedia are now mainstream:
– Server-side rendering with HTMX for dynamic updates.
– Edge computing with Cloudflare Workers to assemble pages.
– MongoDB Atlas with change streams for real-time UIs.
– Spring Boot + Mustache for Java backends.

Links

Posted in en-US | Tags: Caching, DevoxxFR2012, DevoxxFR2013, Fotopedia, MathieuPouymerol, MongoDB, Mustache, Performance, PierreBaillet, Rails, RealTime | No Comments »

[DevoxxFR2012] Client/Server Apps with HTML5 and Java

Author: Jonathan Lalou

Lecturer

James Ward embodies the archetype of the polyglot developer evangelist, bringing a wealth of experience that spans nearly two decades of Java development alongside deep expertise in web technologies and cloud platforms. Having started coding in Java as early as 1997, James initially cut his teeth on server-side frameworks before joining Adobe, where he championed Flex as a rich client technology and contributed to Java Community Process specifications including JSR 286, 299, and 301 for portlet standards. His tenure at Adobe honed his ability to bridge desktop-like experiences with web delivery, a theme that permeates his later work. By 2012, James had transitioned to Heroku as a Developer Advocate, where he focused on democratizing cloud deployment and promoting modern, API-driven architectures. Much like his passion for mountain climbing—which he often analogizes to the challenges of software development—James approaches technical problems with a blend of strategic planning and relentless iteration, seeking elegant solutions amid complex terrain. His presentations are characteristically hands-on, featuring live coding demonstrations that translate abstract concepts into tangible artifacts, making him a trusted voice in the Java and web development communities.

Abstract

James Ward articulates a compelling vision for the resurgence of client/server architectures in web development, leveraging the browser as a sophisticated client powered by HTML5, JavaScript, CSS, and complementary tools, while employing Java-based servers to deliver lightweight, API-centric services. Through an end-to-end live coding session, James demonstrates how to orchestrate a modern stack comprising jQuery for DOM manipulation, LESS for dynamic styling, Twitter Bootstrap for responsive design, CoffeeScript for concise scripting, and the Play Framework for robust backend services. He extends the discussion to cloud-native deployment strategies, utilizing Heroku for application hosting and Amazon CloudFront as a Content Delivery Network to optimize static asset delivery. The presentation meticulously analyzes the methodological advantages of this decoupled approach—enhanced responsiveness, independent release cycles, and superior scalability—while addressing practical concerns such as asset management through WebJars and performance optimization. James positions this architecture as the future of web applications, particularly for mobile-first, global audiences, offering profound implications for development velocity, maintenance overhead, and user experience in an increasingly heterogeneous device landscape.

The Renaissance of Client/Server: From Server-Rendered Monoliths to Decoupled Experiences

James Ward opens with a historical reflection on web architecture evolution, observing that after a decade dominated by server-side rendering frameworks such as JSP, JSF, and Ruby on Rails templates, the pendulum is swinging back toward a client/server model reminiscent of early thin-client applications—but now enriched by the browser’s matured capabilities. In this paradigm, the browser assumes responsibility for rendering rich, interactive interfaces using HTML5, CSS3, and JavaScript, while the server is reduced to a stateless API provider delivering JSON or other data formats over HTTP. This shift, James argues, is propelled by the proliferation of smartphones and tablets, which demand native-like responsiveness and offline functionality that server-rendered pages struggle to deliver efficiently. HTML5 standards—local storage, Web Workers, Canvas, and progressive enhancement—enable applications to function seamlessly across devices without native code, while responsive design principles ensure adaptability to varying screen sizes. James contrasts this with traditional approaches where server-side templates intertwine presentation and logic, creating tight coupling that complicates maintenance and slows iteration. By decoupling concerns, developers can evolve the user interface independently of backend changes, a flexibility that proves invaluable in agile environments where user feedback drives rapid UI refinements.

Front-End Ecosystem: Orchestrating Productivity with CoffeeScript, LESS, and Bootstrap

Delving into the client-side stack, James Ward conducts a live coding demonstration that showcases how modern tools dramatically amplify developer productivity while maintaining code quality. He begins with CoffeeScript, a language that compiles to JavaScript and eliminates much of the verbosity and pitfalls associated with raw JS syntax. By writing expressions such as square = (x) -> x * x, CoffeeScript generates clean, idiomatic JavaScript, reducing boilerplate and enhancing readability. James emphasizes that CoffeeScript’s significant whitespace and functional programming influences encourage a more declarative style, which aligns naturally with event-driven browser programming. Complementing this, LESS extends CSS with variables, mixins, and nested rules, transforming style sheets into programmable artifacts. For instance, defining @brand-color: #428bca; and reusing it across selectors eliminates duplication and facilitates theme switching. Twitter Bootstrap enters the equation as a comprehensive UI framework, providing pre-styled components—navigation bars, modals, grids—and a responsive grid system based on media queries. James demonstrates how a simple <div class="container"> with Bootstrap classes automatically adapts layout from desktop to mobile, obviating custom media query sprawl. Within the Play Framework, these assets are served through a unified pipeline that compiles CoffeeScript and LESS on-the-fly during development and minifies them for production, ensuring optimal performance without manual intervention. This orchestrated ecosystem, James asserts, enables small teams to deliver polished, professional interfaces in a fraction of the time required by hand-crafted HTML and CSS.

Backend as API: Play Framework’s Elegance in Service Design

On the server side, James Ward positions the Play Framework as an exemplary choice for building lightweight, stateless APIs that complement rich clients. Play’s Scala-based syntax offers concise controller definitions, where a route such as GET /api/tasks controllers.Tasks.list maps directly to a method returning JSON via Ok(Json.toJson(tasks)). This simplicity belies powerful features: asynchronous request handling via Akka actors, built-in JSON serialization, and seamless WebSocket support for real-time updates. James live-codes a task management endpoint that accepts POST requests with JSON payloads, validates them using Play’s form mapping, and persists to an in-memory store—illustrating how quickly a functional API can be prototyped. Client-side consumption is equally straightforward; jQuery’s $.ajax or the Fetch API retrieves data and dynamically renders it using Bootstrap templates. James highlights Play’s hot-reloading capability, where code changes trigger instant server restarts during development, eliminating the compile-deploy cycle that plagues traditional Java web applications. For persistence, while the demo uses in-memory storage, James notes seamless integration with relational databases via Anorm or JPA, and NoSQL options through third-party modules, underscoring Play’s versatility across data models.

Cloud-Native Deployment: Heroku and CDN Synergy

Deployment emerges as a cornerstone of James Ward’s vision, and he demonstrates the effortless path from local development to global production using Heroku and Amazon CloudFront. A simple heroku create followed by git push heroku master triggers an automated build process that compiles assets, runs tests, and deploys the application to a dynamically scaled cluster of dynos. Heroku’s add-on ecosystem provides managed PostgreSQL, Redis, and monitoring without operational overhead, embodying Platform-as-a-Service ideals. For static assets—JavaScript, CSS, images—James configures CloudFront as a CDN, caching content at edge locations worldwide to reduce latency and offload server load. Configuration involves setting cache headers in Play and pointing DNS to the CloudFront distribution, a process that takes minutes yet yields significant performance gains. James emphasizes versioning strategies: semantic versioning for APIs combined with cache-busting query parameters for assets ensures smooth upgrades without stale content issues. This cloud-native approach not only accelerates time-to-market but also aligns cost with usage, as Heroku scales dynos automatically and CloudFront bills per byte transferred.

Asset Management Revolution: WebJars and Dependency Convergence

A particularly innovative contribution James Ward introduces is WebJars, a convention for packaging client-side libraries—jQuery, Bootstrap, lodash—as standard JAR files consumable via Maven or Gradle. By declaring <dependency><groupId>org.webjars</groupId><artifactId>bootstrap</artifactId><version>5.3.0</version></dependency> in a POM, developers integrate front-end assets into the same dependency resolution pipeline as server-side libraries, eliminating the chaos of manually downloading and versioning scripts. Play’s asset pipeline automatically extracts these resources to the classpath, making them available via routes.Assets.at("lib/bootstrap/js/bootstrap.min.js"). James demonstrates creating a custom WebJar the night before the talk, packaging both minified and source versions, and stresses the importance of avoiding the historical mistake of scattering JARs in source trees. This unification streamlines builds, enables reproducible environments, and facilitates security patching through centralized dependency updates.

Methodological and Architectural Implications for Modern Web Development

James Ward synthesizes the architectural benefits of this client/server separation, noting that independent release cycles allow frontend teams to iterate on user experience without backend coordination, and vice versa. Responsive design via Bootstrap future-proofs applications against new device form factors, while HTML5’s progressive enhancement ensures graceful degradation on older browsers. Performance considerations—minification, concatenation, CDN caching—combine to deliver sub-second load times even on mobile networks. James addresses testing strategies: Jasmine for client-side unit tests, Specs2 for server-side, and Selenium for end-to-end flows, all integrated into the build pipeline. API versioning through URL paths or headers maintains backward compatibility as schemas evolve. The implications are profound: development velocity increases dramatically, maintenance burden decreases through modularization, and user satisfaction rises with fluid, native-like experiences. For enterprises transitioning from legacy portals, James advocates gradual migration—exposing existing services as JSON APIs while incrementally replacing UI with modern client code—minimizing risk while capturing immediate benefits.

Future Trajectories and Community Considerations

Looking ahead, James Ward anticipates continued maturation of HTML5 standards, broader adoption of Web Components for encapsulation, and potential successors to JavaScript such as Dart or WebAssembly, though he expresses skepticism about near-term displacement given browser ecosystem inertia. Tools like GWT, which compile Java to JavaScript, are acknowledged as viable alternatives for Java-centric teams, but James personally favors direct control over client-side code for maximum flexibility. The presentation closes with an invitation to contribute to WebJars and engage with the Play community, reinforcing open-source collaboration as a catalyst for innovation.

Links:

Posted in en-US | Tags: APIDriven, ClientServerArchitecture, CloudFront, CoffeeScript, DevoxxFR2012, Heroku, HTML5, JamesWard, JavaWeb, LESS, PlayFramework, ResponsiveDesign, TwitterBootstrap, WebJars | No Comments »

[DevoxxFR2012] Input/Output: 16 Years Later – Advancements in Java’s I/O Capabilities

Author: Jonathan Lalou

Lecturer

Jean-Michel Doudoux has pursued a lifelong passion for software development and technological exploration, starting from his early experiences with a Commodore 64. His professional path includes extensive work in SSII settings and independent projects involving multiple programming languages. Embracing Java since its 1.0 release, Jean-Michel has become a prominent educator in the field, authoring two comprehensive tutorials distributed under the GNU FDL license. One focuses on Eclipse, spanning roughly 600 pages, while the other, which he continues to update, covers Java in over 100 chapters and 2400 pages. As co-founder of the Lorraine JUG and a member of YaJUG, he actively engages with the Java community through organization of events and sharing of resources. His website hosts these tutorials and related materials.

Abstract

Jean-Michel Doudoux provides an in-depth review of input/output (I/O) evolution in Java, from the initial java.io package in Java 1.0 to the sophisticated NIO.2 framework introduced in Java 7. He dissects the shortcomings of earlier APIs in handling files, asynchronous operations, and metadata, showcasing how NIO.2 introduces efficient, extensible solutions. With extensive code demonstrations and comparative evaluations, Doudoux covers path operations, attribute management, permissions, directory navigation, and event notifications. The analysis reveals methodological improvements for performance and cross-platform consistency, with significant ramifications for data-intensive applications, server-side processing, and modern software engineering practices.

Early Java I/O: Streams and Initial Constraints

Jean-Michel Doudoux commences by outlining the foundational I/O mechanisms in Java, rooted in the java.io package since version 1.0. This API centered on streams for byte or character data, offering basic functionality for reading and writing but suffering from synchronous blocking that hindered scalability in networked or multi-threaded scenarios. The File class served as the primary interface for filesystem interactions, yet it was limited to simple operations like existence checks, deletion, and renaming, without support for symbolic links or detailed metadata.

Doudoux highlights how paths were treated as mere strings, prone to errors from platform-specific separators—forward slashes on Unix-like systems versus backslashes on Windows. Creating directories required manual verification of parent existence, and attribute access like last modification time necessitated platform-dependent native code, compromising portability. These constraints often forced developers to resort to external libraries for routine tasks, such as recursive directory deletion or efficient copying.

The introduction of NIO in Java 1.4 marked a step forward with channels and buffers for non-blocking I/O, particularly beneficial for socket communications. However, Doudoux notes that filesystem capabilities remained underdeveloped; RandomAccessFile allowed position-based access, but asynchronous file operations were absent. This historical overview underscores the need for a more comprehensive API, setting the foundation for NIO.2’s contributions.

NIO.2’s Core Abstractions: Filesystems, Paths, and Providers

Doudoux proceeds to the heart of NIO.2, encapsulated in the java.nio.file package, which abstracts filesystems through the FileSystem interface and its implementations. This design allows uniform treatment of local disks, ZIP archives, or even custom virtual filesystems, greatly improving code reusability across environments.

Path objects, created via Paths.get(), represent locations independently of the underlying filesystem. Doudoux explains relativization—computing paths between locations—and resolution—combining paths—while normalization eliminates redundancies like “.” or “..”. For instance, resolving a relative path against an absolute one yields a complete location, mitigating common string concatenation pitfalls.

Path base = Paths.get("/projects/devoxx");
Path relative = Paths.get("slides/part3.pdf");
Path fullPath = base.resolve(relative); // /projects/devoxx/slides/part3.pdf
Path cleaned = Paths.get("/projects/./devoxx/../archive").normalize(); // /projects/archive

Filesystem providers extend this flexibility: the default handles local files, but NewFileSystem mounts alternatives like ZIPs. Doudoux codes an example:

Map<String, String> env = new HashMap<>();
env.put("create", "true"); // Create if absent
URI uri = URI.create("jar:file:/data/archive.zip");
FileSystem zipFs = FileSystems.newFileSystem(uri, env);

This enables treating archives as navigable directories, useful for bundled resources or logs. Custom providers could integrate remote storage like FTP or cloud services, expanding Java’s reach in distributed systems.

Managing File Attributes, Permissions, and Security

NIO.2 revolutionizes attribute access, Doudoux asserts, through Files methods and attribute views. BasicFileAttributes exposes universal properties like creation time, size, and directory status, while platform-specific views like PosixFileAttributes add owner and group details.

Reading attributes uses getAttribute() or readAttributes():

Path file = Paths.get("/logs/app.log");
BasicFileAttributes attrs = Files.readAttributes(file, BasicFileAttributes.class);
long size = attrs.size();
Instant lastModified = attrs.lastModifiedTime().toInstant();

Permissions leverage AclFileAttributeView for Windows ACLs or PosixFilePermissions for Unix-like systems:

Path secureFile = Paths.get("/config/secrets.properties");
PosixFileAttributeView posixView = Files.getFileAttributeView(secureFile, PosixFileAttributeView.class);
posixView.setPermissions(PosixFilePermissions.fromString("rw-------"));
posixView.setOwner(UserPrincipalLookupService.lookupPrincipalByName("admin"));

Doudoux stresses platform awareness: not all attributes are supported everywhere, requiring fallback strategies. This granularity enhances security in enterprise applications, allowing fine-tuned access control without external tools.

Efficient Directory Operations and Event Monitoring

Directory traversal in NIO.2 employs Files.walk() for streams or walkFileTree() with FileVisitor for customized walks. Doudoux details visitor callbacks—preVisitDirectory, visitFile, postVisitDirectory, visitFileFailed—enabling actions like filtering or error recovery.

For copying directories:

Path sourceDir = Paths.get("/old/project");
Path targetDir = Paths.get("/new/project");
Files.walkFileTree(sourceDir, EnumSet.of(FileVisitOption.FOLLOW_LINKS), Integer.MAX_VALUE, new SimpleFileVisitor<Path>() {
  @Override
  public FileVisitResult preVisitDirectory(Path dir, BasicFileAttributes attrs) throws IOException {
    Files.createDirectory(targetDir.resolve(sourceDir.relativize(dir)));
    return FileVisitResult.CONTINUE;
  }
  @Override
  public FileVisitResult visitFile(Path file, BasicFileAttributes attrs) throws IOException {
    Files.copy(file, targetDir.resolve(sourceDir.relativize(file)));
    return FileVisitResult.CONTINUE;
  }
});

This handles symlinks and errors gracefully. Monitoring uses WatchService: register paths for events (create, delete, modify), poll for keys, process events.

WatchService watcher = FileSystems.getDefault().newWatchService();
Path monitoredDir = Paths.get("/watched/folder");
monitoredDir.register(watcher, StandardWatchEventKinds.ENTRY_CREATE, StandardWatchEventKinds.ENTRY_MODIFY, StandardWatchEventKinds.ENTRY_DELETE);

while (true) {
  WatchKey key = watcher.take();
  for (WatchEvent<?> event : key.pollEvents()) {
    Path changed = (Path) event.context();
    // React to changed
  }
  boolean valid = key.reset();
  if (!valid) break;
}

Doudoux notes event granularity varies by OS, requiring application-level filtering.

Asynchronous I/O Channels: Non-Blocking for Scalability

NIO.2’s asynchronous channels—AsynchronousFileChannel, AsynchronousSocketChannel—facilitate non-blocking operations, Doudoux explains. Use futures for blocking waits or CompletionHandlers for callbacks.

AsynchronousFileChannel asyncChannel = AsynchronousFileChannel.open(path, StandardOpenOption.READ);
ByteBuffer buf = ByteBuffer.allocate(4096);
Future<Integer> readFuture = asyncChannel.read(buf, 0);
int bytesRead = readFuture.get(); // Blocks until complete

Handlers:

asyncChannel.read(buf, 0, null, new CompletionHandler<Integer, Void>() {
  @Override
  public void completed(Integer result, Void attachment) {
    // Process buffer
  }
  @Override
  public void failed(Throwable exc, Void attachment) {
    // Handle error
  }
});

Doudoux analyzes scalability: servers handle thousands of connections without thread blocking, ideal for high-throughput systems like web servers or data pipelines.

Broader Impacts on Java Ecosystems and Development Practices

Doudoux reflects on NIO.2’s transformative effects: it standardizes operations previously requiring hacks, promoting cleaner code. For portability, abstracted filesystems ease multi-platform development; for performance, asynchronous I/O reduces latency in I/O-bound apps.

In enterprises, this means efficient log monitoring or data migration. Doudoux acknowledges low-level aspects require care but praises extensibility for future integrations like cloud storage.

Overall, NIO.2 modernizes Java I/O, aligning it with contemporary demands for efficiency and flexibility.

Links:

Posted in en-US | Tags: AsynchronousChannels, DevoxxFR2012, DirectoryTraversal, FileSystemOperations, Java7Enhancements, JavaInputOutput, JeanMichelDoudoux, LorraineJUG, NIO2, PathsAttributes, Sciam, WatchService, YaJUG | No Comments »

[DevoxxFR2012] There Is No Good Business Model: Rethinking Domain Modeling for Service-Oriented Design and Implementation

Author: Jonathan Lalou

Lecturer

Grégory Weinbach has cultivated more than twenty years of experience in software development, spanning a diverse spectrum of responsibilities that range from sophisticated tooling and code generation frameworks to agile domain modeling and the practical application of Domain Driven Design principles. His professional journey reflects a deliberate pursuit of polyvalence, enabling him to operate fluidly across the entire software development lifecycle—from gathering nuanced user requirements to implementing robust, low-level solutions. Grégory maintains a discerning and critical perspective on prevailing methodologies, whether they manifest as Agile practices, Model-Driven Architecture, Service-Oriented Architecture, or the contemporary Software Craftsmanship movement, always prioritizing the fundamental question of “why” before addressing the mechanics of “how.” He is a frequent speaker at various technical forums, including the Paris Java User Group and all five editions of the MDDay conference, and regularly conducts in-depth seminars for enterprise clients on pragmatic modeling techniques that balance theoretical rigor with real-world applicability.

Abstract

Grégory Weinbach delivers a provocative and intellectually rigorous examination of a widely held misconception in software design: the notion that a “good” domain model must faithfully mirror the intricacies of the underlying business reality. He argues persuasively that software systems do not replicate the business world in its entirety but rather operationalize specific, value-delivering services within constrained computational contexts. Through a series of meticulously constructed case studies, comparative analyses, and conceptual diagrams, Grégory demonstrates how attempts to create comprehensive, “truthful” business models inevitably lead to bloated, inflexible codebases that become increasingly difficult to maintain and evolve. In contrast, he advocates for a service-oriented modeling approach where domain models are deliberately scoped, context-bound artifacts designed to support concrete use cases and implementation requirements. The presentation delves deeply into the critical distinction between business models and domain models, the strategic use of bounded contexts as defined in Domain Driven Design, and practical techniques for aligning technical architectures with organizational service boundaries. The implications of this paradigm shift are profound, encompassing reduced developer cognitive load, enhanced system evolvability, accelerated delivery cycles, and the cultivation of sustainable software architectures that remain resilient in the face of changing business requirements.

The Fallacy of Universal Truth: Why Business Reality Cannot Be Fully Encapsulated in Code

Grégory Weinbach commences his discourse with a bold and counterintuitive assertion: the persistent belief that effective software modeling requires a direct, isomorphic mapping between code structures and real-world business concepts represents a fundamental and pervasive error in software engineering practice. He elucidates that while business models—typically expressed through process diagrams, organizational charts, and natural language descriptions—serve to communicate and analyze human activities within an enterprise, domain models in software exist for an entirely different purpose: to enable the reliable, efficient, and maintainable execution of specific computational tasks. Attempting to construct a single, monolithic model that captures the full complexity of a business domain inevitably results in an unwieldy artifact that attempts to reconcile inherently contradictory perspectives, leading to what Weinbach terms “model schizophrenia.” He illustrates this phenomenon through a detailed examination of a retail enterprise scenario, where a unified model encompassing inventory management, customer relationship management, financial accounting, and regulatory compliance creates a labyrinthine network of interdependent entities. A modification to inventory valuation rules, for instance, might inadvertently cascade through customer segmentation logic and tax calculation modules, introducing subtle bugs and requiring extensive regression testing across unrelated functional areas.

Bounded Contexts as Cognitive and Architectural Boundaries: The Domain Driven Design Solution

Building upon Eric Evans’ foundational concepts in Domain Driven Design, Weinbach introduces bounded contexts as the primary mechanism for resolving the contradictions inherent in universal modeling approaches. A bounded context defines a specific semantic boundary within which a particular model and its associated ubiquitous language hold true without ambiguity. He argues that each bounded context deserves its own dedicated model, even when multiple contexts reference conceptually similar entities. For example, the notion of a “customer” within a marketing analytics context—characterized by behavioral attributes, segmentation tags, and lifetime value calculations—bears little structural or behavioral resemblance to the “customer” entity in a legal compliance context, which must maintain immutable audit trails, contractual obligations, and regulatory identifiers. Weinbach presents a visual representation of these distinct contexts, showing how the marketing model might employ lightweight, denormalized structures optimized for analytical queries, while the compliance model enforces strict normalization, versioning, and cryptographic signing. This deliberate separation not only prevents the contamination of precise business rules but also enables independent evolution of each model in response to domain-specific changes, dramatically reducing the blast radius of modifications.

Service-Oriented Modeling: From Abstract Nouns to Deliverable Verbs

Weinbach pivots from theoretical critique to practical prescription by advocating a service-oriented lens for domain modeling, where the primary organizing principle is not the static structure of business entities but the dynamic delivery of specific, value-adding services. He contends that traditional approaches often fall into the trap of “noun-centric” modeling, where developers attempt to create comprehensive representations of business objects loaded with every conceivable attribute and behavior, resulting in god classes that violate the single responsibility principle and become impossible to test or modify. Instead, he proposes that models should be constructed around concrete service verbs—”process payment,” “generate invoice,” “validate shipment”—with each model containing only the minimal set of concepts required to fulfill that service’s contract. Through a logistics case study, Weinbach demonstrates how modeling the “track shipment” service yields a streamlined aggregate consisting of a shipment identifier, a sequence of timestamped status events, and a current location, purposefully omitting unrelated concerns such as inventory levels or billing details. This focused approach not only produces cleaner, more maintainable code but also naturally aligns technical boundaries with organizational responsibilities, facilitating clearer communication between development teams and business stakeholders.

The Human Factor: Reducing Cognitive Load and Enhancing Team Autonomy

One of the most compelling arguments Weinbach advances concerns the human dimension of software development. Universal models, by their very nature, require developers to maintain a vast mental map of interrelationships and invariants across the entire system, dramatically increasing cognitive load and the likelihood of errors. Service-oriented, context-bound models, conversely, allow developers to focus their attention on a well-defined subset of the domain, mastering a smaller, more coherent set of concepts and rules. This reduction in cognitive complexity translates directly into improved productivity, fewer defects, and greater job satisfaction. Moreover, when technical boundaries mirror organizational boundaries—such as when the team responsible for order fulfillment owns the order processing context—they gain true autonomy to evolve their domain model in response to business needs without coordinating with unrelated teams, accelerating delivery cycles and fostering a sense of ownership and accountability.

Practical Implementation Strategies: From Analysis to Code

Weinbach concludes his presentation with a comprehensive set of practical guidelines for implementing service-oriented modeling in real-world projects. He recommends beginning with event storming workshops that identify key business events and the services that produce or consume them, rather than starting with entity relationship diagrams. From these events, teams can derive bounded contexts and their associated models, using techniques such as context mapping to document integration patterns between contexts. He demonstrates code examples showing how anti-corruption layers translate between context-specific models when integration is required, preserving the integrity of each bounded context while enabling necessary data flow. Finally, Weinbach addresses the challenging task of communicating these principles to non-technical stakeholders, who may initially resist the idea of “duplicating” data across models. He explains that while information duplication is indeed undesirable, data duplication across different representational contexts is not only acceptable but necessary when those contexts serve fundamentally different purposes.

Links:

Posted in en-US | Tags: BoundedContexts, CognitiveLoad, DevoxxFR2012, DomainDrivenDesign, GrégoryWeinbach, ServiceOrientedModeling, SoftwareArchitecture | No Comments »

[DevoxxFR2012] — Rev:2, Add twitter handle

Author: Jonathan Lalou

ALTER TABLE speaker ADD COLUMN twitter varchar(255);


These scripts run automatically on startup, supporting continuous delivery.

## RESTful API Development
James Ward constructs JSON endpoints:

public class ApiController extends Controller {
public Result speakers() {
List speakers = Speaker.find.all();
return ok(Json.toJson(speakers));
}
}


Play’s `Json.toJson` uses Jackson for serialization, with configuration in `application.conf`:

play.modules.enabled += “play.modules.reactivemongo.ReactiveMongoModule”


## Form Handling and Validation
Nicolas demonstrates form binding:

public class FormsController extends Controller {
static Form speakerForm = Form.form(Speaker.class);

public Result create() {
    Form<Speaker> filled = speakerForm.bindFromRequest();
    if (filled.hasErrors()) {
        return badRequest(views.html.create.render(filled));
    } else {
        Speaker speaker = filled.get();
        speaker.save();
        return redirect(routes.Application.index());
    }
}

}


HTML forms use Play’s template engine:

@form(routes.FormsController.create()) {
@inputText(speakerForm(“name”), ‘_label -> “Name”)
@inputText(speakerForm(“twitter”), ‘_label -> “Twitter”)

}


## Frontend Development with HTML5 and JavaScript
Guillaume Bort integrates Twitter Bootstrap and jQuery for responsive design, demonstrating real-time features with WebSockets:

public class StreamController extends Controller {
public WebSocket updates() {
return WebSocket.whenReady((in, out) -> {
// Push updates
});
}
}


## Cloud Deployment to Heroku
James Ward executes deployment:

heroku create devoxx-play-demo
git push heroku master
heroku addons:create heroku-postgresql


Play’s `Procfile` declares the startup command:

web: target/universal/stage/bin/conference-app -Dhttp.port=${PORT}
“`

Implications for Modern Web Development

The presenters conclude that Play 2.0’s integrated toolchain and cloud-ready architecture dramatically accelerate development cycles while producing applications that scale effortlessly in elastic environments.

Links:

Posted in en-US | Tags: CloudNative, ContinuousDelivery, DevoxxFR2012, GuillaumeBort, Heroku, JamesWard, Lunatech, NicolasLeroux, PlayFramework, Reactive, REST, Zenexity | No Comments »

[DevoxxFR2012] Java on Amazon Web Services: Building Scalable, Resilient Applications in the Cloud

Author: Jonathan Lalou

Lecturer

Carlos Conde operates as a Solutions Architect within Amazon Web Services’ European team, where he partners with enterprises to design, migrate, and optimize Java applications on the AWS platform. His technical journey began over a decade ago when he founded IWorks, a company focused on Java web development atop legacy mainframe systems, followed by co-founding Newaxe, which delivered a Java-based ERP monitoring and management platform. After years as an independent consultant helping Fortune 500 companies modernize their stacks, Carlos joined AWS to bring cloud-native principles to the broader Java community. His expertise spans the full application lifecycle—from initial architecture through production operations—with a particular emphasis on cost optimization, high availability, and security.

Abstract

Carlos Conde provides an exhaustive guide to developing, deploying, and operating Java applications on Amazon Web Services, demonstrating how the platform’s breadth of services enables developers to build systems that are simultaneously scalable, resilient, and cost-effective. He introduces AWS Elastic Beanstalk as a fully managed Platform as a Service solution that abstracts infrastructure complexity, the AWS SDK for Java for programmatic service access, and the AWS Toolkit for Eclipse for seamless IDE integration. Through detailed live demonstrations and real-world case studies from companies like Viadeo and Netflix, Conde explores deployment strategies, database patterns, content delivery, monitoring, and disaster recovery. The presentation addresses hybrid cloud scenarios, data sovereignty requirements under European regulations, and advanced cost management techniques, concluding with a vision of serverless Java and containerized workloads that redefine operational excellence.

The AWS Java Ecosystem: Tools for Every Stage of Development

Carlos Conde opens by mapping the AWS service landscape to the Java developer’s workflow. At the foundation lies Amazon EC2, offering virtual servers with pre-built Java AMIs (Amazon Machine Images) that include OpenJDK, Tomcat, and Spring Boot. For developers seeking higher abstraction, AWS Elastic Beanstalk provides a PaaS experience where applications are deployed via simple commands:

eb init -p java-8 my-java-app
eb create production-env --instance_type t3.medium
eb deploy

Behind the scenes, Beanstalk provisions EC2 instances, Elastic Load Balancers, Auto Scaling groups, and CloudWatch alarms, while allowing full customization through .ebextensions configuration files. Conde demonstrates deploying a Spring Boot application that automatically scales from 2 to 10 instances based on CPU utilization, with zero-downtime blue/green deployments.

The AWS SDK for Java serves as the programmatic bridge to over 200 AWS services. Conde writes a service that stores user profiles in Amazon DynamoDB:

AmazonDynamoDB client = AmazonDynamoDBClientBuilder.standard()
    .withRegion(Regions.EU_WEST_3)
    .build();

Map<String, AttributeValue> item = new HashMap<>();
item.put("userId", AttributeValue.builder().s(userId).build());
item.put("email", AttributeValue.builder().s(email).build());
item.put("createdAt", AttributeValue.builder().n(Long.toString(timestamp)).build());

PutItemRequest request = PutItemRequest.builder()
    .tableName("Users")
    .item(item)
    .build();
client.putItem(request);

DynamoDB’s single-digit millisecond latency and automatic scaling make it ideal for high-throughput workloads. For relational data, Amazon RDS offers managed PostgreSQL, MySQL, or Oracle with automated backups, patching, and multi-AZ replication.

Content Delivery and Global Reach

For static assets, Conde integrates Amazon S3 with CloudFront:

AmazonS3 s3 = AmazonS3ClientBuilder.standard().build();
s3.putObject(PutObjectRequest.builder()
    .bucket("my-app-assets")
    .key("css/style.css")
    .acl(ObjectCannedACL.PublicRead)
    .build(), RequestBody.fromFile(Paths.get("style.css")));

CloudFront’s 200+ edge locations cache content close to users, reducing latency from 180ms to 30ms for European customers. He demonstrates invalidating cache after deployments using the AWS CLI.

Cost Optimization Strategies

Conde presents a multi-layered approach to cost control. Reserved Instances provide up to 75% savings for predictable workloads, while Savings Plans offer flexibility across EC2, Lambda, and Fargate. For batch processing, Spot Instances deliver 90% discounts:

RunInstancesRequest request = RunInstancesRequest.builder()
    .instanceMarketOptions(InstanceMarketOptionsRequest.builder()
        .marketType(MarketType.SPOT)
        .spotOptions(SpotInstanceRequest.builder()
            .spotPrice("0.10")
            .instanceInterruptionBehavior(InstanceInterruptionBehavior.TERMINATE)
            .build())
        .build())
    .build();

He uses AWS Cost Explorer to visualize spending and set budget alerts.

High Availability and Disaster Recovery

Conde designs a multi-AZ architecture with RDS read replicas, ElastiCache for Redis caching, and S3 cross-region replication. He demonstrates failover using Route 53 health checks that automatically reroute traffic if a region fails.

Security and Compliance

Security is baked into every layer. AWS Identity and Access Management (IAM) enforces least-privilege access, while AWS KMS manages encryption keys. Conde enables S3 server-side encryption and RDS TDE (Transparent Data Encryption) with a single click.

Hybrid and Sovereign Cloud Deployments

For European data residency, Conde deploys entirely within the Paris region (eu-west-3). For hybrid scenarios, AWS Direct Connect establishes dedicated network connections to on-premises data centers, and AWS Outposts brings AWS services into private facilities.

Monitoring, Logging, and Observability

Amazon CloudWatch collects metrics, logs, and events. Conde instruments a Spring Boot application with Micrometer:

@Timed(value = "order.processing.time", description = "Time taken to process order")
public Order processOrder(OrderRequest request) {
    // Business logic
}

AWS X-Ray traces requests across services, identifying latency bottlenecks.

Real-World Case Studies

Conde shares Viadeo’s migration of their social platform to AWS, achieving 99.99% availability and reducing infrastructure costs by 60%. Netflix’s global streaming architecture leverages AWS for transcoding, personalization, and content delivery at petabyte scale.

The Future of Java on AWS

Conde previews AWS Lambda for serverless Java, AWS Fargate for containerized workloads without server management, and AWS Graviton2 processors offering 40% better price-performance for Java applications.

Implications for Java Enterprises

Carlos Conde concludes that AWS transforms Java development from infrastructure-constrained to innovation-focused. By leveraging managed services, developers build faster, operate more reliably, and scale globally—all while maintaining strict control over costs and compliance.

Links:

Posted in en-US | Tags: AmazonWebServices, AWS, CarlosConde, CloudFront, CostOptimization, DevoxxFR2012, DynamoDB, EC2, ElasticBeanstalk, HighAvailability, Java, S3, Serverless | No Comments »

[DevoxxFR2012] Manipulation of Bytecode: Let’s Democratize the Black Magics!

Author: Jonathan Lalou

Lecturer

Frédéric Le Mouël holds the position of associate professor at INSA Lyon within the Telecommunications Department and CITI Lab, while also serving as a member of the INRIA Amazones research team. His academic background includes a PhD from University of Rennes 1 focused on adaptive middleware for mobile computing environments. Frederic’s research spans component-oriented architectures, service-oriented computing, and dynamic adaptation mechanisms, with particular interest in bytecode manipulation as an enabling technology for runtime adaptation. The JooFlux project, which he presents, represents a culmination of these research threads.

Abstract

Frédéric Le Mouël challenges the perception of bytecode manipulation as an esoteric practice reserved for framework authors and programming language implementers, demonstrating its practical utility across the software development spectrum from application development to system instrumentation. The presentation systematically demystifies Java bytecode and the JVM’s execution model before examining multiple manipulation tools including ASM, AspectJ, Byteman, and the innovative JooFlux framework. Through live demonstrations and architectural analyses, Le Mouël illustrates how bytecode transformation enables aspect-oriented programming, runtime monitoring, adaptive systems, and performance optimization without source code modification. The session concludes with philosophical reflections on the power and responsibility accompanying low-level JVM access.

Java Bytecode Fundamentals

Java bytecode serves as the intermediate representation executed by the JVM, compiled from Java source through javac. Le Mouël examines the stack-based execution model:

// Source
public int add(int a, int b) {
    return a + b;
}

// Bytecode
0: iload_1
1: iload_2
2: iadd
3: ireturn

Each instruction operates on the operand stack, with local variables accessed through indexed loads and stores. The JVM’s verification process ensures type safety before execution.

ASM: The Foundation of Bytecode Manipulation

ASM provides a low-level API for generating and transforming class files. Le Mouël demonstrates method instrumentation:

public class AddTraceTransformer extends ClassVisitor {
    public MethodVisitor visitMethod(int access, String name, 
                                   String desc, String signature, String[] exceptions) {
        MethodVisitor mv = super.visitMethod(access, name, desc, signature, exceptions);
        if ("process".equals(name)) {
            return new AddTraceAdapter(mv);
        }
        return mv;
    }
}

class AddTraceAdapter extends MethodVisitor {
    public void visitCode() {
        mv.visitCode();
        mv.visitFieldInsn(GETSTATIC, "java/lang/System", "out", "Ljava/io/PrintStream;");
        mv.visitLdcInsn("Entering method");
        mv.visitMethodInsn(INVOKEVIRTUAL, "java/io/PrintStream", "println", "(Ljava/lang/String;)V");
    }
}

This transformation inserts tracing without modifying source code.

AspectJ: High-Level AOP Through Weaving

AspectJ compiles aspects into bytecode transformations. Le Mouël shows transaction management:

@Aspect
public class TransactionAspect {
    @Around("execution(* com.bank.*.*(..))")
    public Object manageTransaction(ProceedingJoinPoint pjp) throws Throwable {
        beginTransaction();
        try {
            Object result = pjp.proceed();
            commitTransaction();
            return result;
        } catch (Throwable t) {
            rollbackTransaction();
            throw t;
        }
    }
}

The AspectJ weaver transforms target classes at compile-time or load-time.

Byteman: Runtime Rule Injection

Byteman enables dynamic rule injection into running JVMs:

RULE Trace method entry
CLASS com.example.Service
METHOD process
AT ENTRY
IF true
DO traceln("Entering process")
ENDRULE

Le Mouël demonstrates hot-patching production systems for debugging.

JooFlux: Research Framework for Dynamic Adaptation

JooFlux introduces a registry-based approach to bytecode manipulation:

JooFlux.register(new Interceptor() {
    public void intercept(Call call) {
        if (call.getMethodName().equals("process")) {
            System.out.println("Intercepted: " + call.getArguments());
            call.proceed();
        }
    }
});

The framework maintains original bytecode for potential restoration.

Links:

Posted in en-US | Tags: ASMFramework, AspectJ, BytecodeManipulation, Byteman, DevoxxFR2012, DynamicAdaptation, FrédéricLeMouël, INSALyon, JooFlux, JVMInternals, RuntimeInstrumentation | No Comments »

[DevoxxFR2012] Nagios checks via NRPE

Author: Jonathan Lalou

package “nagios-nrpe-server”
“`

He demonstrates real-time dashboards that correlate application metrics with infrastructure health, enabling rapid incident response and informed scaling decisions.

Economic and Organizational Impact: Beyond Technical Excellence

Bertrand Paquet concludes with a quantitative analysis of the pipeline’s impact: deployment frequency increased from weekly to multiple times daily, mean time to recovery decreased from hours to minutes, and infrastructure costs became predictable and scalable. He argues that these technical achievements enable organizational agility—new features can be tested in production with real users within hours of conception, creating tight feedback loops that drive product excellence.

Links:

Posted in en-US | Tags: AutomatedProvisioning, BertrandPaquet, CapistranoDeployment, ChefConfiguration, ContinuousDeployment, DevOpsTransformation, DevoxxFR2012, FasterizePlatform, InfrastructureAsCode, MonitoringIntegration, OCTOTechnology, RackspaceCloud | No Comments »

[DevoxxFR2012] Update user subscription status

Author: Jonathan Lalou

end
end
“`

Roux advocates testing pricing models through A/B experiments on payment pages, measuring conversion rates across different tiers. He discusses freemium approaches where core functionality remains free while premium features drive revenue, using analytics to identify which capabilities users value sufficiently to pay for.

Cultural Implications: Democratizing Innovation Through Methodological Discipline

Camille Roux concludes with a broader reflection on how these practices fundamentally democratize technological innovation. By reducing the barrier to entry from months of development to days of validation and prototyping, the methodology empowers a wider spectrum of creators—solo developers, domain experts without coding expertise, and intrapreneurs within large organizations—to test ideas systematically. He argues that this shift represents not merely a technical evolution but a cultural transformation, where failure becomes inexpensive learning rather than career-ending catastrophe, and where success emerges from disciplined experimentation rather than visionary genius.

Links:

Posted in en-US | Tags: BootstrapDesign, CamilleRoux, CustomerDevelopment, DevoxxFR2012, HerokuDeployment, HumanCoders, IdeaValidation, LeanEntrepreneurship, MinimumViableProduct, RapidPrototyping, RubyOnRails, StartupMethodology | No Comments »