International Journal of Information Technology and Computer Science Applications https://ejurnal.jejaringppm.org/index.php/jitcsa <table border="0" width="100%"> <tbody> <tr> <td align="justify" valign="top"><strong>ISSN Print (2964-3139) based on Decree Number 29643139/II.7.4/SK.ISSN/01/2023 dated January 18, 2023;</strong> <p><strong>ISSN Online (2985-5330) based on Decree Number 29855330/II.7.4/SK.ISSN/02/2023 dated February 15, 2023</strong></p> <p><strong>URL : <a href="https://ejurnal.jejaringppm.org/index.php/jitcsa">https://ejurnal.jejaringppm.org/index.php/jitcsa</a></strong></p> <p><strong>The International Journal of Information Technology and Computer Science Applications (IJITCSA)</strong> is an information technology and computer science publication. Applications from both fields for solving real cases are also welcome. The JITCSA accepts research articles, systematic reviews, literature studies, and other relevant ones. The IJITCSA focuses on several fields of science, including information technology and the like and computer science fields such as artificial intelligence, data science, data mining, machine learning, deep learning, and the like. <br /><br />IJITCSA is published three times a year, in January, May, and September. The first issue in January 2023 had eight articles.</p> </td> </tr> </tbody> </table> Jejaring Penelitian dan Pengabdian Masyarakat en-US International Journal of Information Technology and Computer Science Applications 2964-3139 <h1>Attribution 4.0 International</h1> <div id="deed-body"> <h2 id="rights">You are free to:</h2> <ol> <li><strong> Share </strong> — copy and redistribute the material in any medium or format for any purpose, even commercially.</li> <li><strong> Adapt </strong> — remix, transform, and build upon the material for any purpose, even commercially.</li> <li>The licensor cannot revoke these freedoms as long as you follow the license terms.</li> </ol> <h2 id="terms">Under the following terms:</h2> <ol> <li class="cc-by"><strong> Attribution </strong> — You must give <a id="src-appropriate-credit" href="https://creativecommons.org/licenses/by/4.0/#ref-appropriate-credit"> appropriate credit </a> , provide a link to the license, and <a id="src-indicate-changes" href="https://creativecommons.org/licenses/by/4.0/#ref-indicate-changes"> indicate if changes were made </a> . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.</li> <li><strong> No additional restrictions </strong> — You may not apply legal terms or <a id="src-technological-measures" href="https://creativecommons.org/licenses/by/4.0/#ref-technological-measures"> technological measures </a> that legally restrict others from doing anything the license permits.</li> </ol> <h2 class="b-header has-text-black padding-bottom-big padding-top-normal" style="font-weight: bold;">Notices:</h2> <p>You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable <a id="src-exception-or-limitation" href="https://creativecommons.org/licenses/by/4.0/#ref-exception-or-limitation"> exception or limitation </a> .</p> <p>No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as <a id="src-publicity-privacy-or-moral-rights" href="https://creativecommons.org/licenses/by/4.0/#ref-publicity-privacy-or-moral-rights"> publicity, privacy, or moral rights </a> may limit how you use the material.</p> </div> Enhancing Association Rule Mining with Metaheuristic Parameter Optimization: A Transactional Data Analysis in Micro-Enterprise Context https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/204 <p>Nasi Uduk Mama Ipan is a micro-enterprise that conducts sales through both offline and online platforms. However, only online transaction data is available in analyzable form, while the owner lacks the knowledge to process it. This situation highlights the urgency of leveraging data mining techniques to uncover hidden patterns that can inform effective promotional strategies. This study aims to apply association rule mining using Apriori and FP-Growth algorithms, enhanced through metaheuristic-based hyperparameter tuning, to extract meaningful product bundling insights from transactional data. The research begins with data preprocessing, which involves eliminating irrelevant columns and transforming transactional records into a binary format. Four metaheuristic algorithms—Genetic Algorithm, ACO, PSO, and SA—are employed to determine optimal support and confidence values for both Apriori and FP-Growth. The modeling phase is conducted using Python with the mlxtend.frequent_patterns library, with rules filtered using a lift ratio threshold above 1. Results show that both Apriori and FP-Growth algorithms produce identical bundling recommendations using parameters derived from the Genetic Algorithm. Apriori performs faster, while FP-Growth is more memory-efficient. This study demonstrates that combining association rule mining with metaheuristic optimization can effectively support MSMEs in making data-driven marketing decisions.</p> Ferdy Hartanto Primanda Primanda Tb Ai Munandar Khairunnisa Fadhilla Ramdhania Copyright (c) 2026 Ferdy Hartanto Primanda Primanda, Tb Ai Munandar, Khairunnisa Fadhilla Ramdhania https://creativecommons.org/licenses/by/4.0 2026-02-25 2026-02-25 4 1 1 10 10.58776/ijitcsa.v4i1.204 Public Sentiment Analysis on the Service Quality of PT PLN on X Using Naïve Bayes and K-Nearest Neighbor Algorithms. https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/212 <p>Quality services since electricity is a primary public need. However, numerous complaints still highlight PLN’s lack of responsiveness, especially on the X platform (formerly Twitter). This study aims to analyze public sentiment toward PLN’s service quality expressed on X and compare the performance of the Naïve Bayes and K-Nearest Neighbor (KNN) algorithms in classifying sentiments into positive, negative, and neutral categories. The research employs the Knowledge Discovery in Databases (KDD) approach, involving data collection through tweet scraping using Tweet-Harvest, preprocessing (case folding, tokenizing, filtering, stemming), transformation with TF-IDF weighting, and data mining using Naïve Bayes and KNN. Evaluation through a confusion matrix shows that Naïve Bayes achieved an accuracy of 87%, outperforming KNN with an accuracy of 86%. These findings provide insights for PLN to better understand public perception and serve as a reference for future sentiment analysis research using machine learning.</p> Nurul Zahra Wowon Priatna Tyastuti Sri Lestari Copyright (c) 2026 Nurul Zahra, Wowon Priatna, Tyastuti Sri Lestari https://creativecommons.org/licenses/by/4.0 2026-02-25 2026-02-25 4 1 11 20 Data Infrastructure Application in Education: An Integrated Architecture for Secure Learning Analytics and Student Performance Prediction https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/245 <p>Data infrastructure has become a strategic backbone of contemporary education because digital learning environments continuously generate student traces that can be transformed into actionable evidence for teaching, advising, and institutional planning. Yet the practical value of educational data depends on much more than storage capacity. Institutions must integrate heterogeneous sources, manage raw and curated data simultaneously, enforce privacy constraints, and deliver analytics outputs that are operationally useful and ethically defensible. This study develops a layered educational data infrastructure architecture that connects raw learning data, extract-transform-load processes, governance mechanisms, curated analytics repositories, and machine-learning services. This paper includes a reproducible empirical evaluation using the real xAPI-Edu-Data benchmark collected from the Kalboard 360 learning management environment. Three machine-learning models are compared under a common preprocessing pipeline, and an ablation analysis quantifies the incremental value of integrated behavioral, parental, and contextual features. The best-performing model achieves a test macro-F1 of 0.797 and a macro one-vs-rest ROC-AUC of 0.919, while the ablation study shows that the full integrated feature set clearly outperforms demographic-only and behavior-only alternatives. The paper contributes structured architecture, mathematical formalization of integrated learning analytics, and empirical evidence that richer, better-governed data pipelines produce more useful predictive signals for educational decision support.</p> Dinesh Pranav Mukerjea Copyright (c) 2026 Dinesh Pranav Mukerjea https://creativecommons.org/licenses/by/4.0 2026-03-29 2026-03-29 4 1 21 34 10.58776/ijitcsa.v4i1.245 Application of data warehouse and OLAP processes for retail analytics https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/244 <p>Retail organizations increasingly rely on heterogeneous operational platforms, including point-of-sale systems, customer relationship management applications, cloud data stores, and locally administered databases. Although these platforms are valuable for transaction processing, they often generate fragmented, duplicated, and semantically inconsistent data that constrain enterprise reporting, forecasting, and customer intelligence. This paper substantially extends a conceptual SwiftMart case into a full design-and-evaluation study of a retail data warehouse and Online Analytical Processing (OLAP) framework. The proposed artifact combines a Kimball-style dimensional architecture, a governed extract-transform-load (ETL) pipeline, conformed dimensions, and materialized OLAP summaries for managerial analytics. To ground the case empirically, the framework is evaluated using the open-access UCI Online Retail dataset, which contains 541,909 transaction records from a UK-based online retailer covering 1 December 2010 to 9 December 2011. The experiment transforms raw transactions into a star schema with 524,878 curated fact rows, 19,960 orders, 4,355 customer members, 4,158 product members, and 38 countries. Four representative analytical workloads are benchmarked across three storage designs: a normalized operational data store, a dimensional warehouse, and materialized aggregate tables. The dimensional warehouse reduces mean latency by 42.3% relative to baseline joins, while materialized aggregates reduce latency by approximately 99.9%. A forecasting demonstration on warehouse-generated daily revenue aggregates further shows that a random forest model outperforms a naive benchmark, achieving an RMSE of 23,715.84 versus 34,055.29. The paper contributes an end-to-end reference architecture for retail analytics, together with dimensional design rationale, mathematical formulations, algorithms, empirical results, and implementation guidance relevant to both academic researchers and practitioners.</p> Victorio Palben Medel Copyright (c) 2026 Victorio Palben Medel https://creativecommons.org/licenses/by/4.0 2026-04-03 2026-04-03 4 1 35 46 10.58776/ijitcsa.v4i1.244 A Lakehouse-Oriented Big Data Infrastructure for Educational Analytics: Integrating Administrative and Assessment Data for Early Student Risk Prediction https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/248 <p>Educational institutions increasingly depend on heterogeneous digital systems, yet many analytics initiatives remain fragmented across student information, registration, assessment, and learning platforms. This paper proposes a lakehouse-oriented big data infrastructure for educational analytics and validates it through a reproducible early-risk prediction study using the Open University Learning Analytics Dataset (OULAD). The study integrates five public OULAD tables student information, course registration, assessment metadata, student assessment submissions, and course presentation metadata into temporally valid feature tables aligned to the student–module–presentation level. We define a windowed feature engineering framework that constructs actionable indicators such as submission rate, weighted completion score, average submission lag, and assessment coverage gap at 30%, 50%, 70%, and 100% of the course timeline. Two supervised classifiers, logistic regression and random forest, are evaluated under a stratified 80/20 protocol. The results show that administrative data alone provides weak discrimination (AUC 0.673), whereas integrated mid-course assessment evidence substantially improves performance. At the 50% course window, the random-forest model achieves an AUC of 0.947, F1 of 0.879, and recall of 0.829; even at the 30% window the model already reaches an AUC of 0.904. These findings demonstrate that the value of educational prediction depends not only on model choice but also on data integration architecture. The paper contributes (i) a lakehouse-oriented reference architecture for higher-education analytics, (ii) a temporally constrained feature engineering strategy for early-warning systems, and (iii) an empirical ablation showing that multi-source integration yields large and operationally meaningful gains.</p> Bhairav Kaphle Biswajit Shrestha Copyright (c) 2026 Bhairav Kaphle, Biswajit Shrestha https://creativecommons.org/licenses/by/4.0 2026-04-13 2026-04-13 4 1 47 58 10.58776/ijitcsa.v4i1.248 Revisiting the IBM Retail Data Warehouse: A Governed One-Column Architecture and Reproducible Open-Dataset Validation for Retail Analytics https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/247 <p>The IBM Retail Data Warehouse (RDW) correctly recognized the importance of integrated retail data, but it remained largely descriptive, did not formalize the underlying architecture, and lacked a reproducible empirical validation. This paper reconstructs and substantially extends that early proposal into a publication-ready research article. We first synthesize the historical IBM RDW, Retail Data Warehouse Model (RDWM), Retail Services Data Model (RSDM), and Retail Business Solution Template (RBST) concepts with contemporary data warehousing, data governance, and retail analytics literature. We then propose a governed, RDW-informed logical architecture that separates ingestion, quality control, conformed dimensional modeling, analytics marts, and decision-support services. To move beyond conceptual discussion, we instantiate the architecture with an open retail dataset from the UCI Machine Learning Repository containing 541,909 transactions. After governance-oriented preprocessing, the final analytical mart contains 392,692 valid rows, 18,532 orders, 4,338 customers, 3,665 products, and 37 countries. We formulate the transformation and forecasting workflow mathematically, define an end-to-end algorithmic pipeline, and evaluate a retail revenue forecasting task using naive, seasonal naive, linear regression, ridge regression, random forest, and gradient boosting baselines. On the hold-out test window, the best model (linear regression on warehouse-engineered features) achieves an RMSE of 4,302.61 GBP and R<sup>2</sup>=0.9766, while a raw, ungoverned pipeline yields a much weaker RMSE of 10,068.59 GBP. This corresponds to a 57.27% reduction in RMSE attributable to governance and dimensional integration. The results show that the practical value of an RDW-like architecture is not merely organizational; when implemented as a governed analytical platform, it measurably improves reproducibility, interpretability, and forecasting quality.</p> Nayananda Karunaratne Pulasthi Medhananda Copyright (c) 2026 Nayananda Karunaratne, Pulasthi Medhananda https://creativecommons.org/licenses/by/4.0 2026-04-13 2026-04-13 4 1 59 69 10.58776/ijitcsa.v4i1.247 Healthcare Data Integration Through Enterprise Data Warehousing: Architecture, Conformance Pipeline, and Experimental Validation for Readmission Analytics https://ejurnal.jejaringppm.org/index.php/jitcsa/article/view/246 <p>Healthcare organizations operate a fragmented digital landscape in which hospital information systems (HIS), electronic health records (EHR), laboratory systems, billing platforms, and departmental applications are optimized for transaction processing but not for integrated analysis. The resulting interoperability gaps, semantic inconsistency, duplicated records, and uneven data quality constrain enterprise reporting and limit higher-value analytics. This paper substantially proposes implementable enterprise data warehouse architecture, formalizing its data-quality and conformance mechanisms, and validating the design through experimental analytics use case. The proposed framework combines an integration layer for ETL/ELT, conformed dimensions, departmental marts, governance controls, and an analytics layer for OLAP and machine learning. To demonstrate practical value, the paper evaluates the framework on a de-identified inpatient diabetes dataset comprising 101,766 encounters and 50 raw attributes. The experimental pipeline performs profiling, conformance mapping, diagnosis grouping, missing-value treatment, and dimensional modeling before training benchmark readmission models. The best ranking performance is obtained by XGBoost with an AUROC of 0.688 and an AUPRC of 0.235, while threshold tuning improves recall-oriented operational utility. The results show that healthcare warehousing should not be framed merely as centralized storage; rather, it is an architectural mechanism for interoperability, data quality control, reproducible analytics, and decision support. The manuscript concludes with implementation guidance and limitations relevant to hospitals seeking a scalable, governance-aware warehousing program.</p> La Duy Ngôn Copyright (c) 2026 La Duy Ngôn https://creativecommons.org/licenses/by/4.0 2026-04-14 2026-04-14 4 1 70 82 10.58776/ijitcsa.v4i1.246