Why compare CMDB and auto-discovery if we already have a network scanner?

A network scanner shows what is visible on the network at the time of the scan; a CMDB must store context: who owns the asset, where it is, which services depend on it and who supports it. You compare them to understand whether the data can be trusted for procurement, security and support — not just to get a “nice” count of discovered devices.

Which inventory accuracy metrics matter most in a pilot?

On a pilot, look at coverage by segment, correctness of key fields, data freshness and duplicate counts. It’s useful to agree up front what you consider an asset and which key you use for uniqueness; otherwise results from different tools won’t be comparable.

How to run a quick and fair pilot for inventory accuracy?

Create a small canonical set of 30–100 diverse assets with verified owner, location and name. Run 2–3 discovery cycles with the same settings, then compare the share found, the share correctly identified without duplicates, attribute correctness and the speed of appearance/disappearance.

What does agent-based mode give you compared to agentless?

Agents typically provide deeper and more stable data about installed software, versions, local users and device health—especially for devices that rarely connect to the corporate network. Agentless is easier to start with and works well for servers and network gear, but depends more on access, segmentation and firewall rules.

When is an agent essentially required?

Agents are almost unavoidable for remote laptops that don’t appear in the office network for weeks, and for precise software inventory on endpoints. They also help when network polling is unstable or blocked and when you need to capture changes between scans rather than one-off snapshots.

In which cases is agentless discovery the better choice?

Agentless discovery is a good fit for servers, network devices, printers and virtualization when WMI/SSH/SNMP access and read-only credentials are available. It yields faster initial results and can be easier to agree with InfoSec if you limit segments, polling frequency and protocols.

Why do duplicates appear in a CMDB and how to reduce them?

Duplicates usually come from IP changes, renaming, multiple NICs, OS reinstalls or parallel data sources that identify the same asset differently. Start by choosing a primary key (serial number, UUID, cloud ID) and verify whether the tool can merge records instead of continually creating new ones.

What access is needed for discovery and how to do it securely?

Grant minimal necessary rights: SNMP read-only, separate service accounts for Windows and Linux (not domain admins or root), read-only accounts for virtualization and minimal roles for cloud keys. Store secrets securely, rotate them, and keep audit logs of who ran scans, from which host and which credentials were used.

How do Lansweeper, Device42, ServiceNow Discovery and NetBox differ in terms of accuracy?

Lansweeper is often chosen for a fast inventory overview; Device42 excels at dependencies, application/infrastructure relationships and DCIM; ServiceNow Discovery makes sense when CMDB is part of ITSM processes and responsibilities are defined; NetBox is a reliable source of truth for network addressing, racks and topology, but typically needs integrations to cover OS, software and cloud resources.

How to calculate a realistic 3-year TCO for comparison?

Count not only licenses but all costs to keep data usable: infrastructure, implementation, maintenance, connector updates, duplicate cleanup and reference-data normalization. A useful final metric is cost per asset per month over 36 months, accounting for asset growth and regulatory security requirements.

CMDB and Asset Auto-Discovery: Accuracy, Modes, and 3-Year TCO

Why bother comparing inventory accuracy

A CMDB is not just a list of devices produced by a network scanner. A scanner shows what it could see at a specific moment. A CMDB must answer more complex questions: what is this asset, who owns it, where is it located, what is its lifecycle, which services depend on it and who is responsible for support. If a CMDB lacks context and relationships, it quickly becomes just another table people don’t trust.

Inventory often diverges from reality. Shadow assets appear (someone plugs in an access point or mini-PC and forgets it), duplicates arise (the same device appears twice because of different names, IPs or serials), records become stale (a laptop was retired but stays listed as active for years) and coverage gaps occur (some subnets aren’t scanned, some hosts are blocked by firewalls, some devices sit off-network). As a result, security, procurement and licensing reports start to disagree.

That’s why you compare CMDB and auto-discovery: to gain trust in the data, not to produce pretty charts. In practice people evaluate Lansweeper, Device42, ServiceNow Discovery and NetBox: each takes a different approach to data collection, normalization and modeling.

A pilot for inventory accuracy should answer clearly:

what percentage of assets is discovered automatically and where blind spots begin (network segments, VPNs, branches);
how well the tool identifies an asset (serial number, model, owner, role);
how many duplicates appear and why;
how quickly changes (relocation, renaming, component replacement) are reflected in the database;
how much effort is needed to get data into working condition.

Don’t expect any system to build a perfect CMDB in a week without naming standards, data owners and an approval process. A pilot will show how much order already exists in your infrastructure and how much needs to be added for the data to be useful for IT and InfoSec.

Criteria: coverage, accuracy, freshness and duplicates

Agree on terms before you compare CMDB and discovery tools. “Asset” usually means anything valuable to IT: a laptop, a server, a printer, a license. A “CI” (configuration item) is a CMDB record describing an asset and its relationships. A “node” is a discovered host or device. A “software instance” is a specific application installation on a node. “Cloud resource” means a VM, disk, load balancer, account or service living outside your network.

Coverage isn’t just “how many were found” but “which items were found and where.” List mandatory zones in advance: wired networks, Wi‑Fi, remote branches, segments behind VPN, clouds, and also OT and specialized equipment (for example, medical devices in a clinic). A common trap is a tool that sees office PCs well but barely sees branches or devices in isolated VLANs. The final picture looks complete, but it isn’t.

Accuracy is the quality of fields that make inventory useful. It’s one thing to detect “some Windows host,” another to correctly determine hostname, owner, location, model, serial number, IP and MAC. In practice accuracy depends on sources: where owner data is held (HR, AD, tickets), how sites and rooms are labeled, and whether naming standards exist.

Freshness shows how quickly the database reflects changes. New devices should appear promptly; retired or off-network devices should vanish quickly, not after months. Test two simple scenarios: connecting a new laptop in a branch and decommissioning an old server. If both events look timely in the CMDB, updates are configured properly.

Duplicate resilience is about uniqueness rules. The same asset can “split” when IPs change, when there are two NICs, after OS reinstall, or because of multiple data sources. Decide in advance what the primary key is: serial number, UUID, MAC+model combination, or cloud ID. Also check whether the tool can merge records rather than just accumulating them.

To compare solutions honestly during a pilot, keep one metrics table and fill it consistently:

coverage by segments (office, branch, Wi‑Fi, cloud, OT);
accuracy for critical fields (model, serial, owner, location);
freshness (time to detect a new asset and time to remove a decommissioned one);
duplicates (share of records that required manual merging);
source transparency (can you quickly see where each field came from?).

This lets you compare suitability of data for support, security and procurement rather than product popularity.

Agent vs agentless: practical differences

Choosing agent-based or agentless discovery affects not only what data you see, but how quickly you can keep it accurate. It’s a core trade-off: depth of data versus ease of deployment.

Agent-based means a small service is installed on the device. It collects more details and does so more reliably, even if a device rarely reaches the corporate network. Agents typically provide better visibility into installed software (versions, components, sometimes usage), local users, updates, disk health and other metrics hard to collect reliably over the network.

Agentless relies on network discovery and standard protocols/interfaces: network scans, WMI for Windows, SSH for Linux, SNMP for network gear, plus cloud, hypervisor and virtualization APIs. It’s convenient for a fast start, but quality depends heavily on credentials, segmentation and how carefully access is configured.

Where an agent is nearly mandatory

Without agents accuracy tends to fall short in four cases: remote laptops and workstations that don’t connect to the office for weeks; devices in isolated segments without stable WMI/SSH access; comprehensive software inventory across all PCs; and tracking changes between scans rather than taking single snapshots.

Where agentless is more convenient

For servers and infrastructure, agentless often wins: it’s easier to approve from a security standpoint and it provides quick initial results. In a pilot this shows immediately: start scanning a subnet and you’ll see most servers, network devices, printers and VMs.

In practice a mixed approach is almost always best. Example: an organization in Kazakhstan has a central data center and branches. For data center servers and networking they use agentless discovery (SNMP/WMI/SSH plus API integrations), and a lightweight agent is installed only on laptops and key workstations where accurate software inventory matters. This reduces deployment burden while covering the most problematic quality gaps.

Step-by-step: run a pilot and measure accuracy

A pilot is not for a slide deck — it’s to see how CMDB and auto-discovery reflect reality in your environment. A common mistake is judging a tool by the total number of discovered devices without verifying what those devices are and whether duplicates exist.

1) Build a small but diverse canonical set

Select 30–100 assets covering different classes: user PCs, servers, network gear, VMs and several “problem” devices (printers, terminals, medical equipment, Wi‑Fi points). The more diversity, the fairer the test.

Record where these assets are, who owns them and how they should be named. A verified sample is more valuable than a larger unverified pool: 50 checked assets beats 500 guessed ones.

2) Configure the pilot to be fair

Agree rules and prepare data sources beforehand. A helpful sequence:

define sources of truth (AD, hypervisor data, procurement records plus manual verification);
split the network into scan zones (office, data center, branches) and explicitly list what is in the pilot;
prepare credentials: Windows (WMI/WinRM), Linux (SSH), SNMP for network, hypervisor access for VMs;
run 2–3 discovery cycles with identical settings and the same time window;
after each cycle record changes: what appeared, what disappeared, what changed and why.

Two or three cycles are needed to evaluate freshness: some systems find assets well but update details with delay; others update attributes quickly but have weaker identification.

3) Reconcile results with a single template

Choose fields and uniqueness rules in advance. A reconciliation template typically includes:

uniqueness: serial number or UUID, and if missing — MAC + hostname + IP (with caveats);
class and role: PC, server, switch, VM, etc.;
key attributes: model, OS, CPU/RAM, network interfaces;
owners and location: department, site, rack or room;
duplicates and ghosts: unverified records, stale hosts, renamed devices.

Then calculate simple metrics: share of the canonical set found, share correctly identified (without duplicates), share of correct attributes and update speed. These figures form a fair basis for comparing tools and calculating 3-year TCO.

Investigating mismatches and improving data quality

Reduce inventory duplicates

We’ll check the root causes of duplicates and propose uniqueness rules for your infrastructure.

Order an audit

After a pilot you usually see the same pattern: some assets aren’t found, some appear twice, and software/version data looks suspiciously neat until you spot-check. The good news: quality improves quickly if you log mismatches, adopt clear remediation steps and iterate.

Start by separating network problems from access problems. Missed discoveries rarely mean the tools are bad. More often they reflect routing and credential limitations.

Common reasons assets are missed or only partly discovered:

network segmentation (VLAN) and no routing or scanning between segments;
firewall rules blocking required ports (WMI/WinRM, SSH, SNMP, etc.);
NAC or other access policies preventing polling;
inadequate credentials (no local admin, no sudo, wrong SNMP community);
agent disabled, removed, outdated or host in a sleep state.

Duplicates are especially visible when consolidating multiple sources (AD, virtualization, network scans, agents). A duplicate usually appears because the system cannot consistently anchor a record to a single identifier.

Typical duplicate causes: IP change, hostname rename, multiple MACs, parallel sources with different rules, or a server seen both as physical and as a VM. Fixing this requires matching rules and reference data. A model reference helps so that “DL380 Gen10”, “HPE DL380G10” and “ProLiant DL380 Gen10” don’t become three different models. The same applies to locations, departments and asset types. In projects where equipment and support are handled by a local integrator, align these reference lists early to avoid manual CMDB cleanup later.

Check software and version accounting separately. If a tool sees installed packages on 10 PCs it doesn’t mean it sees them equally well on servers, terminal farms and isolated segments. Do selective manual checks across device types and compare not only what is installed but which version and when it was updated.

In the pilot report include concrete details, not just percentages. A minimal helpful set:

share of assets found by segment and type (PCs, servers, network, printers);
share of duplicates and the 3–5 most common causes;
share of assets with correct key attributes (serial, model, owner, location);
software accuracy: name and version match on the control sample;
a list of representative errors with short examples: expected vs actual and why.

Turn each common error into a rule (credential fixes, firewall exceptions, naming normalization, matching logic) and accuracy grows by iteration. After 2–3 cycles data is often good enough for CMDB-driven processes, not just dashboards.

Comparing Lansweeper, Device42, ServiceNow Discovery and NetBox

People often compare tools by device counts. For CMDB accuracy what matters is how well data is glued into a single record, how relationships are maintained and who owns data quality.

Strengths of each system

Lansweeper often wins for a quick start. It’s chosen when you need a basic picture in days: which PCs, servers, network devices and printers exist, and which OS and software versions are installed. Limitations surface where you need a deep relationship model, consistent identifiers across sources and stable performance in segmented environments. Dynamic assets (cloud, ephemeral environments) also require discipline around normalization.

Device42 is useful when accuracy means not only “we found the hardware” but “we understand how it’s connected.” Its strengths are dependencies, application-infrastructure relationships and DCIM (racks, placement, power and ports). When an asset has a rack slot, a switch port and a service relationship, the record becomes much more reliable and duplicate resolution is faster.

ServiceNow Discovery makes sense in large organizations where CMDB is part of ITSM (changes, incidents, configuration management). Accuracy here depends on scanning plus agreed processes: which CI classes to maintain, required fields, data owners and exception handling. Without processes and roles, you’ll collect lots of data but quality and freshness will deteriorate because of duplicates and differing views on what counts as an asset.

NetBox is commonly used as a network source of truth: addressing, VLANs, subnets, racks, interfaces and topology. It enforces structure and order, but by itself doesn’t usually cover full auto-discovery of IT: OS, software, serial numbers, cloud resources and runtime state require additional collectors and integrations.

In practice, choices often break down like this:

need a fast asset overview and basic inventory: Lansweeper;
need relationships, DCIM and fewer duplicates via context: Device42;
need CMDB as part of ITSM with governance and data owners: ServiceNow Discovery;
need order in network addressing and racks while pulling data from other sources: NetBox.

Questions to ask before the pilot

Ask vendors/integrators questions that directly affect data quality:

which connectors and collection methods will actually work in your environment (Windows, Linux, network, virtualization, clouds);
how records from different sources are merged and which keys are used (serial, UUID, hostname, IP);
how the tool surfaces quality issues: duplicates, conflicting fields, stale assets;
how relationships are maintained (port, rack, service, dependency) and how that impacts accuracy;
what is included in 3-year TCO: licenses, agents, infrastructure, implementation, support and operational effort.

If you plan implementation in Kazakhstan, clarify who will assist with integrations and operations. GSE.kz, as a system integrator, often joins at the pilot stage to confirm coverage, agree matching rules and estimate ongoing maintenance effort.

Access, security and infrastructure impact

Prepare infrastructure for CMDB

We’ll select servers and infrastructure for your CMDB, scanners and database load.

Request a quote

The main risk in CMDB and discovery projects is often not “accuracy” but credentials and scan load. You can usually start safely: minimal rights, limited scope and scheduled windows, then expand access only where necessary.

Minimal privileges, secrets and audit

Give only the rights needed to answer specific questions. “Which OS and serial” requires less access than “which patches are installed and which services are running.”

A practical minimum typically looks like: SNMP read-only for network devices (no write and avoid default public/private); a dedicated service account for Windows with limited rights (not domain admin); a separate Linux user with read-only access where possible (no permanent root); a read-only account for virtualization (vCenter/Hyper-V); cloud keys with minimal roles (read-only for resource descriptions).

Don’t store secrets in notes or email. Use a secure vault (at least the one built into your chosen tool) and a clear rotation process. Check that the product supports multiple accounts per collection type, connection testing and scheduled secret rotation.

For incident investigation maintain audit trails: who ran scans, from which node, with which credentials, which segments were touched and what failed (timeout, access denied, blocked). Logs should be retained long enough to correlate with InfoSec events and network changes.

Segmentation, load and InfoSec approvals

Scanning can generate noise in networks and on servers, especially with deep queries or frequent cycles. A safe approach: start with one segment, limit concurrency, set scan windows (e.g. nights for servers) and exclude critical systems.

To speed InfoSec approvals, prepare in advance: a description of data sources and protocols to be polled, a list of accounts and their rights, a diagram of scanner placement and routes (which segments are affected), a secret rotation plan, a load-testing plan and stop criteria.

How to calculate 3-year TCO

TCO matters not because of license price alone but because of what you must spend to keep data complete, fresh and usable over 36 months. Compare not just the total, but also cost per asset (for example, tenge per asset per month).

Start with a simple cost structure and separate one-off and recurring expenses. Typically five buckets: licenses/subscriptions (by assets, nodes, modules), infrastructure (servers/VMs, DB, backups, monitoring, test environment), implementation (connectors, credentials, attribute normalization, basic reports), operations (admin, updates, incident response, user support), and training (admins, support, data owners).

Then add hidden costs that often appear after the pilot: connector maintenance (APIs and credentials change), regular duplicate cleanup and data hygiene, matching rule tuning, custom fields and their lifecycle, manual triage of unknown devices (especially in branches and limited-access networks).

For a 3-year calculation split costs into Year 0 and Years 1–3 and assume asset growth. A practical assumption is 10–30% growth over 3 years or use your procurement forecasts.

Model three scenarios: minimal (single site, no agents, basic integrations), baseline (agents on some critical endpoints, integration with tickets/AD, 2–3 sites) and expanded (multiple zones, separate test environment, more integrations, strict access policies).

Example: an organization has 5,000 assets now and expects 6,500 in 3 years. TCO is the sum of all costs over 36 months divided by the average number of assets in the period ((5000 + 6500) / 2). That yields cost per asset per month and makes license models easier to compare.

If you operate in regulated sectors (government, finance, healthcare), separately account for security requirements: segmentation, minimal-privilege accounts, logging and approval time. These aren’t add-ons — they are real parts of TCO.

Common mistakes when choosing CMDB and discovery

Build the DC foundation on GSE

We’ll propose GSE S200 server options for reliable inventory and data-center services.

Select a server

Disappointment usually stems not from the tool but from how comparisons and implementations were done. Results vary widely because networks, permissions and accounting discipline differ.

A typical mistake is comparing systems without a consistent methodology. One pilot may define uniqueness by hostname, another by serial number and a third by MAC. The outcome is predictable: numbers aren’t comparable and “accuracy” becomes a debate about definitions.

What commonly breaks the choice

Problems usually start with:

expecting 100% accuracy without processes (no data owner, no one confirms changes like laptop issuance, server moves or NIC replacement);
relying on one source (only network scans or only procurement records). Scans miss offline and remote assets; procurement misses shadow assets and replacements;
picking an unhelpful pilot zone: either too “showcase” or too chaotic to tell whether errors are tool-related or environment-related;
no agreement on required fields (serial, model, owner, location, criticality, status: in service, in stock, retired);
ignoring remote work and off-network assets: laptops on business trips, branches behind NAT, home Wi‑Fi, cloud VMs.

How it looks in practice

In a distributed organization some laptops are visible only when in the office. Without rules for such devices (agent, VPN policy, owner confirmation cadence), any system will show fluctuating asset counts.

A sign of a mature selection: clear data ownership, documented discrepancy resolution and agreed authoritative sources for different asset types. If you need an independent view on pilot methodology and integration, system integrators like GSE.kz often join at this stage to make the comparison fair and repeatable.

Checklist and next steps

To avoid a debate over “prettier reports,” agree in advance what you consider an asset and which data must be in the CMDB and discovery. The same laptop in AD, MDM and antivirus often appears as three different objects until you define rules.

Before the pilot check basics: which asset types must be covered (workstations, servers, network, printers, VMs, containers, software, cloud resources, peripherals); which sources are authoritative for each field (AD for user, hypervisor for VM, network controller for ports, procurement for model and cost); uniqueness rules and mandatory fields (serial, hostname, MAC, owner, location, criticality, status); where agents are needed and where agentless is enough (segments with closed ports, remote laptops, DMZ servers); integration plan (ITSM, AD, hypervisors, network devices, clouds, procurement).

What to do next week

Start small but realistic: 1 branch + 1 server rack + 20–30 laptops. Include domain PCs and “special” devices that usually get lost (medical equipment, terminals, network printers).

Then proceed stepwise:

Fix a canonical asset list (procurement and AD) and accuracy criteria: found/not found, correct type, correct key fields.
Run discovery in selected modes (agent/agentless) and take a snapshot on the same day.
Investigate mismatches: duplicates, wrong owners, empty serials, incorrect locations. This usually gives more value than swapping tools.
Estimate 3-year TCO: licenses, scanner/agent infrastructure, admin time for maintenance, integration costs and data-cleanup effort.

If the pilot shows you need help with integration, security and infrastructure (servers, workstations, DC), it’s often easier to proceed with a system integrator. In Kazakhstan, for example, GSE.kz combines hardware supply and integration work and offers 24/7 support through a service network, which helps move from pilot to stable operation faster.