Technical documents
Find the latest technical documents published by members of our collaboration below.
Title | Date | File |
---|---|---|
Forecasting the LINAC3 Ion Beam Current
In the last decade or so, the main attraction at CERN has been the Large Hadron Collider (LHC), which was indispensable to the discovery of the Higgs boson, as well as that of many other particles.[1] The LINAC3 is an integral part of the CERN accelerator complex, as it provides the initial ion b |
11-02-22 | |
Grafana for WinCC-OA based SCADA systems at CERN
WinCC-Open Architecture is a toolkit for creating Supervisory Control and Data Acquisition (SCADA) applications. There are about 650 instances of WinCC OA deployed across the accelerator control systems and experiments at CERN. |
01-10-21 | |
The ATLAS Detector as a Muon Fixed-Target Experiment: Using Generative Models to Simulate Muonic Force Carriers
Greater luminosities of future Large Hadron Collider runs will demand an unprecedented number of event simulations. Computationally that would be an extremely demanding task. Hence new approaches for such undertakes are required. |
01-10-21 | |
Automated Benchmarking of Algorithms for Quantum Systems
We have developed a software platform, ABAQUS (Automated Benchmarking of Algorithms for Quantum Systems), that can be used to benchmark the performance of both software frameworks and hardware devices used in the simulation of quantum computers. |
01-10-21 | |
Performance Visualization of ROOT/IO on HPC Storage Systems
HPC systems are becoming ever more important as a data processing resource for the LHC experiments. HPC sites typically use storage systems different from the well-understood HEP storage systems. |
01-10-21 | |
Implementation of Projects for Periodic Task Execution
On Unix systems (like Linux), the cron daemon allows users to periodically execute specific tasks. |
01-10-21 | |
JEEDY – ORDS Management API
The number of functionalities covered by the Oracle database increases from release to release. Thus also the possibilities to communicate with it from outside. More and more systems use HTTP requests in the form of REST API calls to interact with other systems. |
01-10-21 | |
Quantum GANs for 𝒕𝒕̅𝑯(𝒃𝒃̅) Process Data Generation
In this report, we present the Deep Learning generative model GAN for the Higgs bosont 𝒕𝒕̅𝑯(𝒃𝒃̅) process data generation. Initially, a classical GAN model is considered, with Convolutional layers, Batch Normalization layers, and a Leaky ReLU activation function. |
01-10-21 | |
BioDynaMo Lab
BioDynaMo is an open-source, high-performance, and modular agent-based simulation platform. The main goal of this project is to make BioDynaMo more accessible to the public. |
01-10-21 | |
Evaluation of HPC simulation tools for efficient and cost-effective resource provisioning
At the beginning of my internship, I was introduced to a few tools used at CERN. These tools include the HPCBatch cluster, where I was going to develop the project. |
01-10-21 | |
Noise influence on Quantum Machine Learning models' performance
We analyse how the training and performance of VQC models is affected by noise inherent to NISQ devices. In particular, we study the influence of three different types of quantum hardware noise: measurement errors, single qubit gate errors, and two-qubit gate errors (e.g., CNOT gate). |
01-10-21 | |
Support for Mobile Devices CERN Search Portal
The CERN Search as a Service project provides a platform for providing indexing and search functionality to a wide range of CERN information sources such as Indico, EDMS, SharePoint, Drupal and CERN’s Web pages. The content of these indexes is available via the CERN Search Portal. [1] |
01-10-21 | |
Performance Evaluation of TimescaleDB for Storage of Historical Data from WinCC OA SCADA Systems
This project was completed in the scope of NextGen Archiver (NGA) for WinCC OA SCADA systems. The NGA is a new archiver for WinCC OA that uses a pluggable architecture to support multiple database technologies. |
01-10-21 | |
Abstraction of user storage mechanisms for heterogeneous REANA scientific pipelines
REANA is an open-source reusable research data analysis platform, that allows researchers to run their analyses in remote compute clouds. The analyses use containerised environments and rely on declarative computational workflow specifications. |
01-10-21 | |
Control Interface for the Digital Memory Platform
CERN produces a huge amount of data, such as experimental results, digital documents and multimedia content, which needs to be stored, archived and preserved for many years to come. |
01-10-21 | |
Travelling Holidayman Problem
The Travelling Salesman Problem is a well-known problem where it is required to figure out the shortest route to visit all the stops. Even though it was formulated almost over a hundred years ago, nowadays it still has relevance and interest. |
01-10-21 | |
Deep Learning techniques for signal processing and event reconstruction in DUNE
DUNE - Deep Underground Neutrino Experiment - will be an experiment based in the USA whose main goal will be to study long-baseline neutrino oscillations from an accelerator beam. |
01-10-21 | |
Accelerating HEP Workloads on Kubernetes
Deploying a new open-source based service is an extensive and challenging process, from both technical and user-support perspective. |
01-10-21 | |
Machine Learning for 40 MHZ Scouting at CMS
The Level 1 (L1) trigger at CMS uses coarse-grained information to search for signatures of interesting physics. L1 scouting is a new paradigm for data collection at CMS which could help in the early identification of promising potential signals, independently of any trigger selection bias. |
01-10-21 | |
Platform for Reproducible Analysis
This report investigates how to create reproducible analyses by providing a systemic approach for general users to follow best research practises. It bases on REANA, a dedicated tool for analysis reproducibility developed by CERN. |
01-10-21 | |
Managing Kubernetes Clusters in a multi cloud environment
Nowadays production applications of the Database Applications and Reporting services section (IT-DB-DAR) are running in Kubernetes, taking advantage of the benefits of this technology. |
01-10-21 | |
Benchmarking BioDynaMo
The purpose of this project was to test and analyse the BioDynaMo project’s performance. In order to achieve this, BioDynaMo uses its own benchmark tools. Benchmarking is highly important because it allows us to test the code performance and monitor the effect of the changes. |
01-10-21 | |
EOS Storage Resource Monitor
This report describes how the EOS Storage Resource Monitor was created. It explains which tools were used and the reasoning behind these choices. It describes the deployment model and the visualisations that are displayed. |
01-10-21 | |
Open Source Cloud Costing Framework
In the last years the amount of resources that are needed to conclude experiments at CERN grew rapidly, and some computations began to be performed in clouds. To control them we need some tools for cloud costing. |
01-10-21 | |
Data Lake as a Service for Open Science
The project is aimed at deploying the ESCAPE DataLake-as-a-Service over the course of the summer. The DataLake-as-a-Service is a service that allows end-users to interact with the ESCAPE Data Lake in an easily-understandable and user-friendly way. |
01-10-21 | |
PORTABILITY FOR THE PATATRACK PIXEL TRACKS AND VERTICES RECONSTRUCTION
The purpose of this report is to explain in detail the improvements to the Alpaka version of the Patatrack pixel track and vertex reconstruction repository in the form of caching allocators for allocating and reusing device and host memory as well as smart pointers as an interface for memory mana |
01-10-21 | |
CERN Quantum Technology Initiative Strategy and Roadmap
Quantum technologies have the potential to revolutionise science and society as early as the next five to ten years but require resources that are not mainstream today. |
07-10-21 | |
Higgs analysis with quantum classifiers
We have developed two quantum classifier models for the ttH¯ (bb¯) classification problem, both of which fall into the category of hybrid quantumclassical algorithms for Noisy Intermediate Scale Quantum devices (NISQ). |
15-04-21 | |
On a poset of quantum exact promise problems
Two of the most well-known quantum algorithms, those introduced by Deutsch–Jozsa and Bernstein–Vazirani, can solve promise problems with just one function query, showing an oracular separation with deterministic classical algorithms. |
01-06-21 | |
Dual-Parameterized Quantum Circuit GAN Model in High Energy Physics
Generative models, and Generative Adversarial Networks (GAN) in particular, are being studied as possible alternatives to Monte Carlo simulations. It has been proposed that, in certain circumstances, simulation using GANs can be sped-up by using quantum GANs (qGANs). |
29-03-21 | |
Quantum Generative Adversarial Networks in a Continuous-Variable Architecture to Simulate High Energy Physics Detectors
Deep Neural Networks (DNNs) come into the limelight in High Energy Physics (HEP) in order to manipulate the increasing amount of data encountered in the next generation of accelerators. |
26-01-21 | |
Embedding of particle tracking data using hybrid quantum-classical neural networks
The High Luminosity Large Hadron Collider (HL-LHC) at CERN will involve a significant increase in complexity and sheer size of data with respect to the current LHC experimental complex. |
12-11-21 | |
Hybrid Quantum Classical Graph Neural Networks for Particle Track Reconstruction
The Large Hadron Collider (LHC) at the European Organisation for Nuclear Research (CERN) will be upgraded to further increase the instantaneous rate of particle collisions (luminosity) and become the High Luminosity LHC (HL-LHC). |
26-09-21 | |
A Serverless Cloud Integration For Quantum Computing
Starting from the idea of Quantum Computing which is a concept that dates back to 80s, we come to the present day where we can perform calculations on real quantum computers. |
05-07-21 | |
Convolutional LSTM models to estimate network traffic
Network utilisation efficiency can, at least in principle, often be improved by dynamically re-configuring routing policies to better distribute ongoing large data transfers. |
23-08-21 | |
High energy physics calorimeter detector simulation using generative adversarial networks with domain related constraints
Generative Adversarial Networks (GANs) have gained notoriety by generating highly realistic images. The present work explores GAN for simulating High Energy Physics detectors, interpreting detector output as three-dimensional images. |
02-08-21 | |
Fast Simulation of a High Granularity Calorimeter by Generative Adversarial Networks
We present the 3DGAN for the simulation of a future high granularity calorimeter output as three-dimensional images. |
09-09-21 | |
Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning-and physics-based simulations on high-performance computers
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. |
12-10-21 | |
MadFlow: automating Monte Carlo simulation on GPU for particle physics processes
We present MadFlow, a first general multi-purpose framework for Monte Carlo (MC) event simulation of particle physics processes designed to take full advantage of hardware accelerators, in particular, graphics processing units (GPUs). |
18-06-21 | |
Convolutional LSTM models to estimate network traffic
Network utilisation efficiency can, at least in principle, often be improved by dynamically re-configuring routing policies to better distribute ongoing large data transfers. |
06-07-21 | |
HEPiX Benchmarking Solution for WLCG Computing Resources
The HEPiX Benchmarking Working Group has developed a framework to benchmark the performance of a computational server using the software applications of the High Energy Physics (HEP) community. This framework consists of two main components, named HEP-Workloads and HEPscore. |
14-12-21 | |
HEP workloads at HPC
As part of CERN-GEANT-PRACE-SKA collaboration and in the context of EGI-ACE (Advanced Computing for the European Open Science Cloud ) collaborators are working towards enabling efficient HPC use for Big Data sciences. |
29-11-21 | |
Quantum Classifiers Hybrids with Advanced Data Compression Methods for Higgs Identification on Noisy Simulations
The advantage of quantum computers over classical devices lies in the possibility of using quantum superposition effects of n qubits to perform exponential computations in parallel. |
29-11-21 | |
Study on impacts of quantum noises on qGAN training
In classical deep learning, a number of studies have proven that noise plays a crucial role in the training of neural networks. Artificial noises are often injected in order to make the model more robust, faster converging, and stable. |
29-11-21 | |
Running the Dual-PQC GAN on noisy simulators and real quantum hardware
In an earlier work [1], we introduced dual-Parameterized Quantum Circuit (PQC) Generative Adversarial Networks (GAN), an advanced prototype of quantum GAN, which consists of a classical discriminator and two quantum generators that take the form of PQCs. |
29-11-21 | |
Ensemble Generative Models for Calorimeter Simulations
Foreseen increasing demand for simulations of particle transport through detectors in High Energy Physics motivated the search for faster alternatives to Monte Carlo based simulations. |
29-11-21 | |
Accelerating GAN training using highly parallel hardware on public cloud
With the increasing number of Machine and Deep Learning applications in High Energy Physics, easy access to dedicated infrastructure represents a requirement for fast and efficient R&D. |
08-11-21 | |
Deep Learning Strategies for ProtoDUNE Raw Data Denoising
In this work, we investigate diferent machine learning-based strategies for denoising raw simulation data from the ProtoDUNE experiment. The ProtoDUNE detector is hosted by CERN and it aims to test and calibrate the technologies for DUNE, a forthcoming experiment in neutrino physics. |
02-03-21 | |
Quantum Machine Learning for HEP detectors simulations
Quantum Machine Learning (qML) is one of the most promising and very intuitive applications on near-term quantum devices which possess the potential to combat computing resource challenges faster than traditional computers. |
06-07-21 | |
Benchmark of Generative Adversarial Networks for Fast HEP Calorimeter Simulations
Highly precise simulations of elementary particles interaction and processes are fundamental to accurately reproduce and interpret the experimental results in High Energy Physics (HEP) detectors and to correctly reconstruct the particle flows. |
07-01-22 | |
Physics Validation of Novel Convolutional 2D Architectures for Speeding Up High Energy Physics Simulations
The precise simulation of particle transport through detectors remains a key element for the successful interpretation of high energy physics results. However, Monte Carlo based simulation is extremely demanding in terms of computing resources. |
19-05-21 | |
An in silico hybrid continuum-/agent-based procedure to modelling cancer development: Interrogating the interplay amongst glioma invasion, vascularity and necrosis
A paper that develops a three-dimensional in silico hybrid model of cancer, which describes the multi-variate phenotypic behaviour of tumour and host cells. |
01-01-21 | |
GPU Acceleration of 3D Agent-Based Biological Simulations
Researchers in biology are faced with the tough challenge of developing high-performance computer simulations of their increasingly complex agent-based models. |
30-04-21 | |
BioDynaMo: a modular platform for high-performance agent-based simulation
Motivation: Agent-based modeling is an indispensable tool for studying complex biological systems. However, existing simulation platforms do not always take full advantage of modern hardware and often have a field-specific software design. |
16-09-21 | |
Platform for Reproducible Analyses
This report investigates how to create reproducible analyses by providing a systemic approach for general users to follow best research practises. It bases on REANA, a dedicated tool for analysis reproducibility developed by CERN. |
29-09-21 | |
Exploring hybrid quantum-classical neural networks for particle tracking
The High Luminosity Large Hadron Collider (HL-LHC) at CERN will involve a significant increase in the complexity and sheer size of data with respect to the current LHC experimental complex. |
01-09-20 | |
Performance of Particle Tracking Using a Quantum Graph Neural Network
The Large Hadron Collider (LHC) at the European Organisation for Nuclear Research (CERN) will be upgraded to further increase the instantaneous rate of particle collisions (luminosity) and become the High Luminosity LHC. |
25-01-21 | |
Quantum Track Reconstruction Algorithms for non-HEP applications
The expected increase in simultaneous collisions creates a challenge for accurate particle trackreconstruction in High Luminosity LHC experiments. Similar challenges can be seen in non-HEPtrajectory reconstruction use-cases, where tracking and track evaluation algorithms are used. |
28-07-21 | |
HIOS: Heterogeneous I/O for Scale
The project HIOS aims to investigate ways of providing efficient interfaces for I/O intensive applications. |
07-06-21 | |
Quantum Gate Pattern Recognition and Circuit Optimization for Scientific Applications
There is no unique way to encode a quantum algorithm into a quantum circuit. With limited qubit counts, connectivities, and coherence times, circuit optimization is essential to make the best use of near-term quantum devices. |
02-12-21 | |
Event Classifcation with Quantum Machine Learning in High‑Energy Physics
We present studies of quantum algorithms exploiting machine learning to classify events of interest from background events, one of the most representative machine learning applications in high-energy physics. |
03-01-21 | |
Validation of Deep Convolutional Generative Adversarial Networks for High Energy Physics Calorimeter Simulations
In particle physics the simulation of particle transport through detectors requires an enormous amount of computational resources, utilizing more than 50% of the resources of the CERN Worldwide Large Hadron Collider Grid. |
25-03-21 | |
Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case
Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. |
18-03-21 | |
The True Random Privacy Project
True Random Privacy (TRP) project, developed during the Random Power hackathon 2020, aims to create a new differential privacy solution for images, embedding a state-of-the-art features description technique. |
05-11-20 | |
Machine Learning applications on OpenStack log data analysis
A massive amount of data is generated by the Openstack cloud services in the format of service logs. Besides timestamps and log level fields, these logs contain additional information useful for pattern analysis. |
31-08-19 | |
Automation Tools for Invenio
Invenio is an open source framework, initially developed at CERN, but with many external users and contributors at this moment and prospects of growing even more in the future. Its nature as a digital |
31-08-19 | |
Graph Neural Network Inference on FPGA
Graph Neural Network possess prospect in track reconstruction for the Large Hadron Collider use-case due to high dimensional and sparse data. |
31-08-20 | |
Summer-student report: Automation Tools for Invenio
Invenio is an open source framework, initially developed at CERN, but with many external users and contributors at this moment and prospects of growing even more in the future. Its nature as a digital |
31-08-19 | |
Summer-student report: Neuromorphic Computing in High Energy Physics
At particle colliders, more data are produced than what the experiments can store for further analysis. This is why the incoming collisions are processed in real time by a so-called trigger system. At the |
31-08-19 | |
Summer-student report: IMPLEMENTATION OF A QUANTUM PERCEPTRON IN INTEL-QS
With the pervasiveness of high-speed computers and processors, computer companies are looking for new technologies to incorporate into their products and use as a competitive advantage in the market. Two modern and rapidly growing techniques are quantum computing and the use |
31-08-19 | |
Summer-student report: Performance monitoring using intel performance counters for HEP applications
The HPC service at CERN provides linux batch infrastructure to run high performance computing applications that require MPI clusters.The HPC cluster is therefore dedicated to run MPI programs. |
31-08-19 | |
Summer-student report: EOS Winston: Expert Systems for Automated Diagnosis and Remediation
This report describes EOS Winston, an event driven alerting and mitigation automation platform. Through the use of expert rules and online anomaly detection algorithms, it catches events which |
31-08-19 | |
Summer-student report: Portable Early Prediction of Sepsis from Clinical Data on Intel Myriad X
Sepsis is a life-threatening condition where microbes present in the blood stream cause an unregulated immune response from the body which can result in tissue damage, multi-organ failure |
31-08-19 | |
Summer-student-report: Deep I/O Performance Analysis of CernVM-FS using Modern Linux Tools
This report describes performance analysis of the CernVM-FS FUSE which is a software distribution service used in high-energy physics research. The performance analysis was conducted in both kernel |
31-08-19 | |
Summer-student report: EOS Integration into OpenStack Manila
The purpose of this report is to provide a brief overview of what OpenStack is, focusing on the advantages of the integration of its Manila component at CERN. Furthermore, this document briefly |
31-08-19 | |
Summer-student-report: Continuous integration for containerized scientific workflows
On this project, we decided to implement two solutions that integrate REANA and GitLab. They vary on two main points. The first one is the amount of configuration necessary to set up the integration, |
31-08-19 | |
Summer-student-report: Building effective Restful APIs with Oracle Rest Data Services 19
In 2005, the first installation of the Oracle HTML DB came out in production. Very soon the CERN developer community adopted the technology, using it in all the areas of the organization, from administrative applications to accelerators control system. |
31-08-19 | |
Summer-student-report: Web - UI development IoT Security Framework
The IoT security framework is a computer security platform designed to assess the risks of various heterogeneous IoT devices. The framework is currently being developed at CERN and analyses different IoT devices connected to CERN’s General Purpose Network (GPN). The GPN mostly |
31-08-19 | |
Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics
Using detailed simulations of calorimeter showers as training data, we investigate the use of deep learning algorithms for the simulation and reconstruction of particles produced in high-energy physics collisions. |
14-12-19 | |
Summer-student-report: Evaluation of Erasure Coding and other features of Hadoop 3
Hadoop ecosystem is distributed computing platform for Big Data solutions by comprising autonomous components such as HDFS, Spark, YARN etc. HDFS is a Hadoop Distributed File System for data storage. Current HDFS supports 3x replication for data fault-tolerance. When a |
31-08-19 | |
Summer-student-report: Big Data Analysis and Machine Learning at Scale with Oracle Cloud Infrastructure
This work has successfully deployed two different use cases of interest for High Energy Physics using cloud resources: CMS Big data reduction: This use case consists in running a data reduction workloads for |
31-08-19 | |
Summer-student-report: Function-as-a-Service on Kubernetes using Knative
The CERN Cloud Infrastructure team provides compute resources as a service to teams across CERN. Users can provision resources to process experiment data, host web applications, and accomplish other computing tasks.
|
31-08-19 | |
Summer-student-report: Benchmarking and optimising large scale parallel workflows
The main idea of this project is to carry out performance analysis on the RDataFrame class within the ROOT operational framework. For this purpose, scalability analysis are performed on the execution |
31-08-19 | |
Summer-student-report: Anomaly Detection in the Elasticsearch Service
The Elasticsearch Service is a distributed search and analytics engine widely used across CERN. Currently, issues in the service are resolved manually after being detected through internal monitoring by service |
31-08-19 | |
Summer-student-report: Benchmarking tools for NextGen Archiver for WinCC OA
On this project we focused on benchmarking Influx against Oracle database. One of the primary reason is ETM/Seimens were already working on Influx database backend. To perform benchmarking using the Query Benchmark Tool we needed to have same data |
31-08-19 | |
Summer-student-report: Performance study of parquet codecs
This report describes the work carried out to study and evaluate the performance and footprint of different parquet compression codecs on data retrival and analytics scenarios Parquet is a standard-de-facto and the data format used to persist |
31-08-19 | |
Summer-student-report: Improving BioDynaMo build system
When developing new programs or scientific libraries most of the efforts are focused on providing efficient algorithms, the state-of-the-art techniques and maximum flexibility. However, in order for a |
31-08-19 | |
Summer-student-report: Evaluate ElastAlert for IT-DB use cases
The Database Services Group (IT-DB) is responsible for providing database and middleware services to the laboratory. For these services, it is necessary to provide proper monitoring solutions to different user |
31-08-19 | |
Summer-student-report: Real-Time Server Monitoring and CNN Inference on FPGA
Neutrinos are subatomic particles, very similar to an electron, but without any electrical charge and a very negligible rest mass. They are the most abundant and perhaps the most mysterious matter particles in the universe!
|
31-08-19 | |
Summer-student-report: Using deep learning for particle identification and energy estimation in CMS HGCAL L1 trigger
In run 4 of the LHC, the extreme high luminosity is expected to generate an enormous pileup of up to 200 proton-proton collisions for each bunch crossing. This has to be read out at 750 kHz with a maximum |
31-08-19 | |
Summer-student-report: Apache Spark on Hadoop YARN & Kubernetes for Scalable Physics Analysis
Big Data Technologies popularity continues to increase each year. The vast amount of data produced at the LHC experiments, which will increase further after the upgrade to HL-LHC, makes the exploration of new ways to perform physics |
31-08-18 | |
Summer-student-report: HGCAL Fast Simulation with Deep Learning
This project uses Wasserstain Generative Adversatial Networks (WGANs) to supply the demand for large simulation samples in the event of the CMS Phase II Upgrade. The distributions of real |
31-08-18 | |
Summer-student-report: Achieve a 0-downtime CERN Database infrastructure
At CERN we have many systems which provide critical services and scheduling downtime for them is quite difficult. Live kernel patching is a technique which aims to update the system without |
31-08-18 | |
Summer-student-report: Introducing heterogeneous farms in the CMS framework
The High Luminosity upgrade scheduled for 2026 will greatly increase the number of events per collision. Moore’s law will optimistically get a factor 4 performance gain, not enough to handle the |
31-08-18 | |
Summer-student-report: Java Mission Control Evaluation
This reports summarises the project I worked on during my internship with the IT-DB-IMS team. This report will detail my efforts to configure various technologies to work with Java Mission Control, the |
31-08-18 | |
Summer-student-report: Efficient unpacking of required software from CERNVM-FS
In recent times a tool for efficient unpacking of software work-flows from CernVM File System (CVMFS) into standalone images has become necessary. There are two types of use cases for such images: On the one hand they can be used to deliver |
31-08-18 | |
Summer-student-report: Benchmarking Machine Learning in HEP
The interest on machine learning workloads in the HEP community has increased exponentially in the last years, making more and more important the need of a thorough benchmarking of the most relevant/significant workloads that are going to run on the experiments. The purpose |
31-08-18 | |
Summer-student-report: Evaluating Ceph Deployments with Rook | 31-08-18 | |
Summer-student-report: Scanning Containers for Vulnerabilities on Kubernetes Clusters
On this project, we chose to work with Clair, the tool developed by CoreOS, which uses static analysis to find vulnerabilities in container images. To use Clair, we had to build a Python client, |
31-08-18 | |
Summer-student-report: Benchmarking Kudu and Oracle in typical WinCC OA historical data retrieval use cases
WinCC Open Architecture is a toolkit for creating Supervisory Control and Data Acquisition (SCADA) applications, which is widely used at CERN. Hundreds of controls applications, both in the accelerator complex and the experiments are based on it, |
31-08-18 | |
Summer-student-report: KPIs Dashboard for Invenio-Related Services
The purpose of this report is to document the project I was working on for nine weeks during the summer of 2018. As part of the CERN openlab Summer Student Program 2018 I had the opportunity to work with the Digital Repositories (IT-CDA-DR) section at CERN on developing a |
31-08-18 | |
Summer-student-report: Technical Network Validation Using Open-shift
The interest in using containers to package applications is constantly growing in the software development community, especially with new technologies such as Kubernetes, Open-shift being adopted more frequently as well. This project also based on modularising the currently |
31-08-18 | |
Summer-student-report: Automated Shelter Recognition in Refugee Camps
In June 2018, more than 68.5 Million people across the globe were reported to be fleeing war or persecution. Within the United Nations, UNOSAT is the organ in charge of collecting demo- |
31-08-18 | |
Summer-student-report: Develop streaming pipelines and analytics solutions for CERN's IoT Platform
There are two very popular concepts that we hear in the world of technology, Big Data and Internet of Things. Big data is referring to a data which size, complexity and velocity is really high and is difficult to capture, pre-process and analyze it with |
31-08-18 | |
Summer-student-report: Distributed BioDynaMo
Computer simulations have become a very powerful tool for scientific research. In order to fa- cilitate research in computational biology, the BioDynaMo project aims at a general platform for |
31-08-18 | |
Summer-student-report: GPGPU Accelerated Beam Dynamics Interfacing PyHEADTAIL with SixTrackLib
Simulations of beam dynamics vastly profit from parallelisation with high performance computing tech- niques. The two simulation libraries SixTrackLib and PyHEADTAIL are GPGPU accelerated. The former |
31-08-18 | |
Summer-student-report: Optimization of Data Transfer for 100 Gb/s Ethernet
In 2019 the LHCb experiment will go through an important upgrade, that will improve performance in many fields. One oh these fields is the DAQ system: it consists of a big flow of data that comes |
31-08-18 | |
Summer-student-report: Employing HPC for Heterogeneous HEP Data Processing
One of the most time consuming algorithms that is currently employed for the reconstruction of High Energy Physics (HEP) workflows is the local energy reconstruction. The time spent to execute this algorithm constitutes 24% of the total processing time, thus achieving substantial |
31-08-18 | |
Summer-student-report: POSEIDON - Analyzing the secrets of the Trident Node monitoring
Improving the performance of an application is an important objective carried out from the application conception until its deprecation. Developers are constantly trying to improve the performance of their |
31-08-18 | |
Summer-student-report: yXRootD PyPI distribution and new declarative file access API for XRootD Client
The project described in this report is related to XRootD framework development. It was divided into two parts. First part was about publishing XRootD python bindings called PyXRootD to Python Package Index. This makes PyXRootD installation much easier and resolves problem |
31-08-18 | |
Summer-student-report: Parallel Task Execution
Puppet is a great tool for making changes on systems, and ensuring that those changes happen. But Puppet is not intended to make this happen on many systems at the same time. Puppet is intended for eventual compliance over time. Each agent checks in over a period of time, al- |
31-08-18 | |
Summer-student-report: Thin Element Comparison Between MAD-X and SixTrack
In this report thin, single elements were compared between MAD-X and SixTrack. A testing framework for efficient comparisons between the two tracking codes was developed. A few dif- ferences between the tracking codes were found then documented and two bugs, one in the |
31-08-18 | |
Summer-student-report: OpenStack Infrastructure Optimization Service
CERN operates an OpenStack based private cloud to provide its users with resources on demand. It is one of the largest OpenStack deployments in the world, with more than 300,000 cores over 9,000 hypervisors [1]. |
31-08-18 | |
Summer-student-report: MPI Learn - distributed training
MPI Learn is a framework for the distributed training of neural networks. This platform is aimed at machine learning users, who can use it to train models faster, without dealing with the com- |
31-08-18 | |
Summer-student-report: Function as a Service
Function as a service (FaaS) is a category of cloud computing services that provides a platform allowing customers to develop, run, and manage application functionalities without the complexity of building and maintaining the infrastructure |
31-08-18 | |
Summer-student-report: Natural Language Processing for Scientific Research
The goal of this Openlab project is to create a Smart Data Analytics Platform for Science that will host analytical tools, publish data, share resources, interact with bots, collaborate and build communities of researchers with various backgrounds in a single ecosystem. With |
31-08-18 | |
Summer-student-report: Deep Representation Learning for Trigger Monitoring
We propose a novel neural network architecture called Hierarchical Latent Autoencoder to exploit the underlying hierarchical nature of the CMS Trigger System for data quality monitoring. |
31-08-18 | |
Summer-student-report: Evaluation of Containers for HPC
Some of the main challenges in scientific computing today deal with performance-preserving portability of software and reproducibility of the final results; likewise, with the advent of modern |
31-08-18 | |
Summer-student-report: Information aggregation and analytics for ATLAS Frontier
Squid-Frontier system [1] is currently used to manage access to the COOL database [2]. This system includes many widely distributed computing sites and applications. Clients presented by PanDA (Production ANd Distributed Analysis system, the ATLAS’ |
31-08-18 | |
Summer-student-report: Malware analysis management
Malware Analysis Management (M.A.M.) or the automated sandbox analysis of quarantined malware samples focuses on a detailed analysis of malware samples reaching CERN through email traffic. M.A.M. is a side process of the main email pipeline |
31-08-18 | |
Summer-student-report: REANA - user dashboard for reusable analysis platform
REANA is a reusable analysis platform which offers physicists the ability to structure their research data analysis and run their computational workflows in a containerized computing cloud. |
31-08-18 | |
A New Platform for Large-Scale Biological Simulation
Computer simulations have become a very powerful tool for scientific research. In order to facilitate research in computational biology, the BioDynaMo project aims at a general platform for biological computer simulations, which should be executable on hybrid cloud computing systems. |
28-11-16 | |
From Physics to industry: EOS outside HEP
In the competitive market for large-scale storage solutions the current main disk storage system at CERN EOS has been showing its excellence in the multi-Petabyte high-concurrency regime. |
01-09-17 | |
Exploring RapidIO Technology within a DAQ System Event Building Network
Exploring RapidIO RapidIO (http://rapidio.org/) technology is a packet-switched high-performance fabric, which has been under active development since 1997. The technology is used in all 4G/LTE basestations worldwide. |
10-05-17 | |
RapidIO as a multi-purpose interconnect
RapidIO (http://rapidio.org/) technology is a packet-switched high-performance fabric, which has been under active development since 1997. Originally meant to be a front side bus, it developed into a system level interconnect which is today used in all 4G/LTE base stations world wide. |
20-06-17 | |
A Deep Learning tool for fast detector simulation
Machine Learning techniques have been used in different applications by the HEP community: in this talk, we discuss the case of detector simulation. |
27-06-18 | |
An optimization approach for agent-based computational models of biological development
Current research in the field of computational biology often involves simulations on high-performance computer clusters. It is crucial that the code of such simulations is efficient and correctly reflects the model specifications. |
10-03-18 | |
CERN openlab: Engaging industry for innovation in the LHC Run 3-4 R&D programme
LHC Run3 and Run4 represent an unprecedented challenge for HEP computing in terms of both data volume and complexity. New approaches are needed for how data is collected and filtered, processed, moved, stored and analysed if these challenges are to be met with a realistic budget. |
04-03-17 | |
Extending an asynchronous messaging library using an RDMA-enabled interconnect
As computing power and I/O performance is increasing at an aggressive rate several RDMA enabled interconnect technologies have been entering the market, promising low latency and high throughput. |
20-03-17 | |
1000 things you always want to know about SSO but you never dare to ask | 12-10-17 | |
Exploring RapidIO Technology Within a DAQ System Event Building Network
RapidIO technology is a packet-switched high-performance fabric, which has been under active development since 1997. The technology is used in all 4G/LTE base stations worldwide. |
09-09-17 |