Below is a comprehensive and scientifically designed SPLK-2002 Study Plan, integrating:
Pomodoro Technique: Focused 25-minute study sessions followed by short breaks to maintain concentration.
Forgetting Curve (Ebbinghaus): Strategic spaced repetition and review to maximize long-term retention.
Structured Objectives: Clear weekly learning goals and daily tasks based on the 20 core exam topics.
| Component | Details |
|---|---|
| Study Technique | Pomodoro (25 min study + 5 min break × 4), 1 long break |
| Review Strategy | Spaced review on Day 3, 7, 14, 30 based on Forgetting Curve |
| Weekly Goal | Cover 3–4 knowledge domains with daily deep dives |
| Weekly Review | End-of-week summary + mock questions (Sat/Sun optional) |
| Materials Needed | Study notes, Splunk lab/sandbox, access to Splunk Web, official docs |
Theme: Splunk Architecture and Planning Fundamentals
Study Days: Day 1 to Day 5
Daily Time Commitment: 3 hours (4 Pomodoro sessions per day + review)
Weekly Objective:
Understand the core architecture and components of Splunk.
Learn how to assess business requirements and plan index design and system resources accordingly.
Build the foundation to support advanced topics in later weeks.
Goal: Understand the structure of Splunk’s distributed architecture and each component's role.
Tasks:
Read a detailed breakdown of Splunk’s key components: Forwarders, Indexers, Search Heads, Cluster Master, Deployer, License Master.
Create a labeled architecture diagram by hand or using a tool (e.g., draw.io).
Write a 1–2 sentence explanation of the role of each component in your own words.
Research the typical data flow (from data ingestion to search visualization) and sketch a flow chart.
Use spaced repetition to create flashcards for each Splunk role.
Expected Output:
A complete architecture diagram.
Short notes on each component’s function.
8–10 flashcards for review.
Goal: Learn how to gather, document, and analyze project requirements before designing a Splunk deployment.
Tasks:
Study common requirement categories: data sources, ingest volume, user roles, use cases.
Write a fictional project brief that includes:
Three data sources (e.g., firewalls, application logs, cloud logs)
Estimated data volume (e.g., 200 GB/day)
User types (SOC team, IT operations, compliance officer)
Use case types (threat detection, log centralization, audit reporting)
List 5 questions you would ask a customer to gather more detailed deployment requirements.
Create a checklist to validate that business needs align with deployment goals.
Expected Output:
One fictional project requirements document.
One project scoping checklist.
5 review questions on requirements analysis.
Goal: Learn how to structure and manage indexes based on data type, retention, and volume.
Tasks:
Review the concept of index buckets (hot, warm, cold, frozen).
Practice calculating retention settings using frozenTimePeriodInSecs.
Create a table with three sample indexes:
Name
Sourcetype
Retention time
Storage path
Analyze when and why to split data across multiple indexes.
Review how data model acceleration may affect indexing performance.
Expected Output:
An index planning table with at least three indexes.
One written paragraph explaining bucket lifecycle.
Flashcards on bucket types and retention settings.
Goal: Understand how to plan CPU, memory, storage, and network resources for a deployment.
Tasks:
Read Splunk guidelines for hardware sizing based on daily ingest.
Write down formulas or rules of thumb (e.g., one indexer per 300GB/day, one SH per 10 users).
Use your Day 2 scenario to calculate required:
Number of indexers
Number of search heads
Network capacity between forwarders and indexers
Storage requirements for hot/warm/cold buckets
Compare SSD vs HDD usage for bucket types.
Prepare a sample hardware plan with basic specs (CPU, RAM, Disk) per node type.
Expected Output:
A capacity plan for your fictional deployment.
A table comparing resource use for different components.
10 review flashcards on sizing best practices.
Goal: Reinforce everything learned in Days 1–4 through review and practical testing.
Tasks:
Review and update flashcards made earlier in the week.
Attempt at least 10 practice questions (multiple choice or written) covering topics from Days 1–4.
Redraw the Splunk architecture diagram from memory, then compare it with the original and correct mistakes.
Write a one-page summary:
Key architecture takeaways
Design considerations
Common mistakes to avoid when planning indexes and resources
Expected Output:
Flashcard accuracy ≥ 80%
Corrected architecture diagram
One-page study journal entry (summary + reflection)
Theme: Clustering, Forwarding, Performance, and Troubleshooting Fundamentals
Study Days: Day 1 to Day 5
Daily Study Duration: 3 hours/day
Study Method: 4 Pomodoros/day (25 min + 5 min break)
Weekly Objective:
Understand the structure and function of Splunk clustering systems (Indexer Cluster and SHC).
Learn how to configure and manage Universal and Heavy Forwarders using the Deployment Server.
Monitor system performance using native tools and logs.
Begin practicing structured troubleshooting using built-in Splunk diagnostic tools.
Goal: Understand the architecture and mechanics of Indexer and Search Head clustering in Splunk.
Tasks:
Read in-depth about Replication Factor (RF) and Search Factor (SF), and their effect on data availability.
Draw diagrams for both Single-site and Multisite Indexer Clusters, labeling key components.
Compare roles: Cluster Master (Manager Node), Peer Node, Deployer, Search Head, SHC Captain.
Create a configuration snippet example showing RF=3, SF=2 setup.
Write a comparison table: single-site vs multisite clustering – features, benefits, limitations.
Expected Output:
Two diagrams showing cluster architectures.
One written explanation of RF vs SF behavior.
Comparison chart with at least 5 differences.
Goal: Master the differences between UF and HF, and learn how to centrally manage forwarder configuration using a Deployment Server.
Tasks:
List all differences between Universal Forwarder (UF) and Heavy Forwarder (HF). Include CPU usage, parsing ability, and use cases.
Write sample configuration files for the following:
inputs.conf to monitor a log file
outputs.conf for load balancing to multiple indexers
deploymentclient.conf to connect to Deployment Server
Write a serverclass.conf file that targets Linux forwarders only.
Diagram the communication between a Deployment Server and 100 UFs.
Explain why indexing on an HF may lead to license overuse.
Expected Output:
A set of 4 complete sample config files.
One written explanation of UF vs HF.
One deployment workflow diagram.
Goal: Learn how to monitor Splunk performance (indexing and searching), identify bottlenecks, and apply basic tuning techniques.
Tasks:
Use Monitoring Console to explore:
Indexing throughput (KB/s)
Search concurrency
Skipped/long-running searches
Read and interpret entries in metrics.log and scheduler.log.
Identify signs of blocked queues (e.g., indexQueue, typingQueue) and link them to causes.
Create a table: performance metric → problem → solution (e.g., high CPU → inefficient SPL).
Write a checklist of best practices for tuning SPL and search performance.
Expected Output:
Completed table of performance symptom → cause → solution.
Screenshot or notes from Monitoring Console.
5 best-practice tips for search performance.
Goal: Begin using core Splunk diagnostic tools to troubleshoot configuration and data pipeline issues.
Tasks:
Use splunk btool to inspect live config layering (props.conf, transforms.conf) and identify override precedence.
Generate a diagnostic bundle using splunk diag. Extract key paths: configs, logs, crash dumps.
Explore the REST API to list and check current search jobs:
Use /services/search/jobs
Get search state, duration, and user
Open and analyze a recent search job’s logs in:
$SPLUNK_HOME/var/run/splunk/dispatch/<sid>/search.logCreate a flowchart: issue symptom → related log → appropriate tool → resolution step.
Expected Output:
One flowchart of troubleshooting steps.
CLI output snippets for btool and diag.
REST API request example and result notes.
Goal: Reinforce and internalize all knowledge from this week’s study, identify weak points, and correct misunderstandings.
Tasks:
Use your flashcards from the week (target: 20+ cards) and test yourself. Track your recall accuracy (aim for ≥ 85%).
Redraw the Indexer Cluster and SHC architecture diagrams from memory, then compare with original versions.
Attempt 10 multiple-choice or case-based questions based on:
Forwarder behavior
Cluster mechanics
Performance metrics
Config files (inputs.conf, outputs.conf, serverclass.conf)
Write a one-page weekly summary:
5 key concepts learned
3 tools mastered
2 potential mistakes to avoid
1 most valuable insight from the week
Expected Output:
Revised and corrected diagrams.
Practice question answer log.
A 1-page self-written review summary.
Updated Anki/flashcards with marked “need review” tags.
Theme: Troubleshooting, Configuration Management, and Search Optimization
Study Days: Day 1 to Day 5
Daily Study Duration: 3 hours/day
Study Method: 4 Pomodoros/day (25 min + 5 min break)
Weekly Objective:
Learn how to clarify and classify problems using a structured approach.
Troubleshoot license and crash-related issues.
Understand how Splunk reads and overrides configuration files.
Detect and resolve search-related problems such as performance, permissions, and errors.
Goal: Develop structured diagnostic thinking by learning how to break down, categorize, and trace issues across Splunk components.
Tasks:
Read about the four-question model:
Who is affected?
What is the symptom?
When did it start?
Where is it happening?
Create a diagnostic checklist based on the model.
Review and classify these issues into categories:
Data Collection
Indexing Delay
Search Failures
Configuration Errors
Practice with 2 fictional scenarios. For each, write:
Problem summary
Clarification steps
Likely root cause
Build a table: issue type → potential component → likely log source → tool to use.
Expected Output:
A full diagnostic checklist template.
Completed case breakdowns.
A reference mapping table from symptom to investigation path.
Goal: Understand how licensing works, what causes violations, and how to troubleshoot Splunk crashes using logs and tools.
Tasks:
Study license architecture: License Master, stacking, daily volume tracking.
Simulate (on paper or lab) two licensing issues:
UF misconfigured to index data
HF double-indexing before forwarding
Read splunkd.log entries showing licensing violations. Write down message patterns.
Explore common crash causes:
Memory leaks
File descriptor exhaustion
Corrupt binaries
Practice reading crash logs and explain when you would use:
Restart
Reinstall
--repair_kvstore
Expected Output:
One simulated license violation scenario documented.
Chart: problem → log pattern → action.
Flashcards on license components and crash handling flow.
Goal: Learn how to manage and troubleshoot .conf files using btool and understand Splunk’s configuration precedence model.
Tasks:
Review precedence order for configuration files (system/local > app/local > app/default > system/default).
Use btool to analyze the active settings in props.conf and inputs.conf.
Simulate a conflict: app-level sourcetype setting overrides a global setting. Use btool to trace the source.
Write config snippets for:
inputs.conf (monitor file)
props.conf (set sourcetype)
transforms.conf (rename a field)
Practice using the splunk reload command (document what configs can be reloaded vs what requires restart).
Expected Output:
One summary of config precedence.
At least 3 config snippets tested or written.
btool command outputs annotated.
Goal: Identify the root causes of search failures and slow performance, and improve SPL quality.
Tasks:
Write 3 inefficient search queries and explain why they are problematic (e.g., search *).
Rewrite them using indexed fields and better SPL practices.
Run a search using Splunk Web and analyze using Job Inspector. Record:
Execution phases
Duration
CPU usage
Review search failure types:
Permissions denied
Missing knowledge objects
Long dispatch times
Read and analyze search.log, scheduler.log, and dispatch.log from a recent job.
Expected Output:
3 improved SPL examples.
One full Job Inspector report with analysis.
Summary notes of common search issues and logs involved.
Goal: Reinforce your understanding of Splunk problem-solving, configurations, and search performance through active recall and problem-based review.
Tasks:
Review and update your flashcards (minimum 25) for all topics this week.
Answer 10 multiple-choice or scenario-based practice questions. Focus on:
Config file precedence
Diagnosing license overage
Search error causes
Redraw the config precedence tree from memory, then compare and correct.
Write a 1-page summary:
Most common real-world Splunk issues
Your personal troubleshooting checklist
3 insights or lessons learned
Expected Output:
Corrected config tree diagram.
Practice question results.
1-page reflection summary.
Theme: Deployment Architecture, Enterprise Scaling, and Site-Aware Indexer Clustering
Study Days: Day 1 to Day 5
Daily Study Duration: 3 hours/day
Study Method: 4 Pomodoros/day (25 min + 5 min break)
Weekly Objective:
Understand the Splunk Deployment Server and its challenges.
Design and plan large-scale Splunk environments.
Compare and implement single-site and multisite indexer clustering.
Analyze deployment best practices, high availability, and disaster recovery strategies.
Goal: Learn how the Deployment Server works and how to troubleshoot configuration distribution failures.
Tasks:
Review how the Deployment Server functions: what it manages, how server classes work, and what clients need.
Write a valid serverclass.conf file that groups forwarders by OS type and deployment group.
Simulate a common problem: forwarders not receiving apps. Write a diagnostic process including log paths and likely causes.
Analyze the deploymentserver.log for key phrases like Pushing, Failed, or No matches.
Summarize best practices for managing large numbers of forwarders (e.g., app size, serverclass overlap).
Expected Output:
A working serverclass.conf example.
Troubleshooting checklist for app deployment failures.
Summary of DS best practices (at least 5 items).
Goal: Build the skills to design and evaluate enterprise-grade Splunk deployments for high volume and user load.
Tasks:
Design a fictional deployment for an enterprise ingesting 2 TB/day with 100+ users. Define:
Number of indexers
Number of SHC members
Role separation: Deployer, DS, License Master, Cluster Master
Choose a cloud-based or on-prem model, then list benefits and risks.
Write a capacity plan: storage tiering (hot/warm → SSD, cold → HDD, frozen → S3).
Document your disaster recovery strategy using multisite clustering, or cross-site backups.
Summarize the differences between small vs large-scale deployments in table form.
Expected Output:
Enterprise-scale architecture diagram.
Capacity plan and DR strategy document.
Table: small vs large Splunk environment characteristics.
Goal: Understand how a single-site indexer cluster operates and when it's appropriate.
Tasks:
Redraw a single-site indexer cluster with RF=3, SF=2, and 5 indexers.
Simulate an indexer going offline: what happens to RF and SF, and which logs would reflect it?
List advantages (simplicity, performance) and limitations (no DR, hardware risk).
Review settings in server.conf and indexes.conf that affect cluster setup.
Practice interpreting clustermaster.log for bucket fix-ups and replication failures.
Expected Output:
Labeled single-site cluster diagram.
One written scenario of failure and recovery response.
Config snippet with RF/SF definitions.
Goal: Learn how to design, configure, and monitor a site-aware, multi-datacenter indexer cluster.
Tasks:
Design a multisite cluster with site1 = New York, site2 = London.
Write the server.conf portion for both sites including:
site=site1
site_replication_factor = origin:2, total:3
site_search_factor = origin:1, total:2
Discuss the impact of latency and bandwidth between sites.
Write a procedure for what happens if an entire site goes down (failover, DR mode).
Compare RF/SF behavior in single-site vs multisite environments.
Expected Output:
Two-site configuration sample.
DR failover plan document.
Chart comparing single vs multisite behavior.
Goal: Reinforce week’s knowledge by recreating designs, answering deployment-focused questions, and reflecting on tradeoffs.
Tasks:
Flashcard review: deployment, RF/SF behavior, site config settings. Aim for 85% recall.
Rebuild both your single-site and multisite cluster diagrams from memory, then verify for completeness.
Complete 10 scenario-based practice questions on:
DS failures
RF/SF violations
DR architecture
Write a 1-page technical reflection:
How does site awareness change replication behavior?
What tradeoffs exist between performance and resilience?
Expected Output:
Redrawn architecture diagrams with RF/SF paths.
Deployment question set results.
Technical summary page with use-case examples.
Theme: Indexer and Search Head Cluster Management and Operations
Study Days: Day 1 to Day 5
Daily Study Duration: 3 hours/day
Study Method: 4 Pomodoros/day (25 min + 5 min break)
Weekly Objective:
Operate and manage an indexer cluster: health checks, bucket replication, RF/SF enforcement.
Master the Search Head Cluster (SHC): app deployment, captain election, and troubleshooting sync issues.
Use CLI and log files to monitor cluster health and respond to errors.
Goal: Learn how to use the Cluster Master to monitor, manage, and repair an indexer cluster.
Tasks:
Review roles in an indexer cluster:
Cluster Master (Manager Node)
Peer Nodes (Indexers)
Run the CLI command:
splunk show cluster-status
Analyze RF/SF compliance and peer node sync states
Open and interpret clustermaster.log:
Simulate a rebalance or manual bucket fix-up procedure: document the command and conditions to use it
Write an operational checklist:
What to check daily on Cluster Master
What logs to check after a peer failure
Expected Output:
Annotated output from splunk show cluster-status
One-page checklist for indexer cluster monitoring
Summary table: common cluster issues and actions
Goal: Understand the SHC structure, role of each component, and how knowledge objects are synchronized.
Tasks:
Draw the Search Head Cluster structure with at least 3 SHC members, 1 Deployer, and 1 Captain
Define responsibilities:
Deployer: app/config distribution
Captain: search scheduling, object replication
Cluster members: search execution
Study how knowledge objects (saved searches, lookups, macros) are synchronized
Learn captain election process: what triggers it, and how quorum is maintained
Summarize the differences between SHC and standalone SH from both admin and user perspectives
Expected Output:
Labeled diagram of SHC
Table: SHC vs standalone SH
Written summary of the SHC replication mechanism
Goal: Use the SHC Deployer to safely push apps and configurations to cluster members.
Tasks:
Review the folder structure for app deployment:
$SPLUNK_HOME/etc/shcluster/apps/Prepare a simple custom app with a saved search and a macro, and place it under the apps folder
Run the bundle push command:
splunk apply shcluster-bundle -target https://<captain>:8089 -auth admin:passwordReview the deployment logs: check if the app was distributed successfully
Simulate a rolling restart:
Define when it is required
Write down the proper sequence
Expected Output:
A successfully deployed test app
CLI command summary with notes
Rolling restart procedure document
Goal: Detect and resolve common synchronization failures in the SHC using logs and cluster status.
Tasks:
Use splunk show shcluster-status to check:
App replication status
Captain identity
Member sync state
Open and analyze shclustering.log:
Look for messages related to failed bundle replication
Identify out-of-sync members
Write a list of causes for bundle sync failure:
Bundle too large
Network issues
Role mismatch
Create a health audit checklist:
How to verify app consistency
What to check before pushing a bundle
Practice correcting an out-of-sync SHC node by restarting and checking logs
Expected Output:
Annotated shclustering.log snippets
Health checklist for SHC maintenance
Summary of sync failure causes and solutions
Goal: Consolidate cluster operation knowledge through recall, question-solving, and procedural writing.
Tasks:
Flashcard review (20+ cards):
Cluster roles
CLI commands
Sync procedures
Log locations
Redraw both the Indexer Cluster and Search Head Cluster architectures from memory
Answer 10 scenario-based practice questions on:
Rebalancing buckets
SHC bundle conflicts
Peer node failures
Write a one-page operational summary for production environment:
Daily checks
Deployment practices
Failure recovery steps
Expected Output:
Corrected cluster architecture diagrams
Question results with notes
One operational summary sheet ("Cluster Management in Production")
Theme: KV Store Mastery, Full Review, and Exam Simulation
Study Days: Day 1 to Day 5
Daily Study Duration: 3 hours/day
Study Method: 4 Pomodoros/day (25 min + 5 min break)
Weekly Objective:
Understand and manage KV Store collections, permissions, and lookups.
Reinforce knowledge across all 20 core topics using spaced repetition.
Complete a full-length mock exam and identify remaining weaknesses.
Finalize all diagrams, notes, and flashcards for long-term retention.
Goal: Understand how KV Store works, how to define and manage collections, and how it integrates with lookups and apps.
Tasks:
Read the structure of a KV Store collection: keys, values, JSON format, schema flexibility.
Write a collections.conf file defining a custom collection (e.g., user access table or session tracking).
Use the REST API to perform the following (via cURL or Postman):
GET records
POST new records
DELETE entries
Simulate exposing a KV Store as a lookup using transforms.conf.
Review backup and repair strategies using:
kvstore-to-json.py
splunkd --repair_kvstore
Expected Output:
One working collections.conf + test entries
REST API command set with result snapshots
Summary notes: permissions, use cases, maintenance routines
Goal: Review and reinforce learning from the first two weeks: architecture, planning, clustering, and deployment.
Tasks:
Flashcard review (at least 40 cards) from Week 1 and 2:
Splunk architecture
Index design
Resource sizing
Forwarders
Clustering logic
Reconstruct key diagrams from memory:
Full Splunk deployment map
Forwarder to Indexer to SH data flow
RF/SF behavior example
Do 10 practice questions based on:
Index lifecycle
Deployment best practices
Monitoring Console analysis
Write a study sheet summarizing:
5 Splunk components
3 types of clusters
3 common misconfigurations and their fix
Expected Output:
Rewritten architecture diagrams
Practice question answers and reflection
Personal summary sheet (Week 1–2)
Goal: Reinforce learning from Weeks 3 and 4: troubleshooting, configuration, search, licensing, and multisite design.
Tasks:
Review flashcards (40+) from Week 3 and 4:
Licensing
Crash handling
Configuration precedence
Search issues
Multisite cluster config
Do 10 scenario-based practice questions:
Troubleshooting flow
btool and diag usage
Site replication failure
Review key logs and their use:
splunkd.log
scheduler.log
clustermaster.log
Redraw a multisite cluster config (with RF/SF and site awareness). Explain in writing how failover works.
Expected Output:
Flashcard quiz result log
Practice question notes
Written failover procedure for multisite
Goal: Simulate the actual exam environment, identify weak areas, and plan targeted final revision.
Tasks:
Complete a full-length mock exam (60–75 minutes, 50 questions). Use a quiet environment and no references.
Score your result. Target: 85%+
Analyze all missed or uncertain questions:
Identify which topic each belongs to
Write a note summarizing what was wrong and how to fix it
Rank topics by confidence (High / Medium / Low)
Create a mini study plan for any "Low Confidence" topics on Day 5
Expected Output:
Mock exam result and reflection
Missed question log
Confidence heat map by topic
Goal: Complete the final round of active recall and reinforce memory through summarization and teaching-style review.
Tasks:
Write a two-line summary of each of the 20 core topics
Prepare and “teach” one topic out loud (or record it) — pretend you're teaching someone else
Review top 10 most difficult flashcards from the entire course
Revisit your deployment and cluster diagrams, ensure you can annotate them from memory
Write your own “cheat sheet” — 2–3 pages max, including:
Key commands
File paths
Logs
Conf file functions
Deployment checklists
Expected Output:
20 concise topic summaries
Final flashcard round (100% accuracy goal)
A personal cheat sheet for last-minute exam review