About: LDBC: Creating a Metric for SNB

Not logged in : Login

(Sponging disallowed)

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: LDBC: Creating a Metric for SNB Goto Sponge NotDistinct Permalink

An Entity of Type : schema:BlogPosting, within Data Space : www.openlinksw.com associated with source document(s)
QRcode icon

http://www.openlinksw.com/describe/?url=http%3A%2F%2Fwww.openlinksw.com%2Fdataspace%2Foerling%2Fweblog%2FOrri%2520Erling%2527s%2520Blog%2F1830

Attributes	Values
has container	Orri Erling's Blog
Date Created	2014-11-13 16:09:00.580195-05:00(dt:dateTime)
maker	Orri Erling
topic	http://www.openlinksw.com/dataspace/oerling#this Orri Erling Orri Erling's Blog page 1 http://www.openlinksw.com/dataspace/services/weblog/item http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog/1830#this
described by	proxy:entity/http/www.openlinksw.com/dataspace/organization/dav/sioc.rdf proxy:entity/http/www.openlinksw.com/dataspace/organization/dav proxy:entity/http/www.openlinksw.com/dataspace/organization/openlink proxy:entity/http/www.openlinksw.com/dataspace/organization/uda proxy:entity/http/www.openlinksw.com/dataspace/person/oat proxy:entity/http/www.openlinksw.com/dataspace/person/oerling proxy:entity/http/www.openlinksw.com/dataspace/person/borislav http://virtuoso.openlinksw.com:8889/about/id/entity/http/www.openlinksw.com/dataspace/organization/vdb proxy:entity/http/www.openlinksw.com/dataspace/person/iODBC proxy:entity/http/www.openlinksw.com/dataspace/organization/vdb proxy:entity/http/www.openlinksw.com/dataspace/person/dba proxy:entity/http/www.openlinksw.com/dataspace/person/upstream proxy:entity/http/www.openlinksw.com/dataspace/person/openlink proxy:entity/http/www.openlinksw.com/dataspace/person/iODBC/about.rdf proxy:entity/http/www.openlinksw.com/dataspace/person/jeep1688 proxy:entity/http/www.openlinksw.com/dataspace/oerling proxy:entity/http/www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog proxy:entity/http/www.openlinksw.com/dataspace/organization/openlink/sioc.rdf http://data.openlinksw.com/about/id/entity/http/www.openlinksw.com/dataspace/organization/openlink proxy:entity/http/www.openlinksw.com/dataspace/organization/vdb/about.rdf http://virtuoso.openlinksw.com:8889/about/id/entity/http/www.openlinksw.com/dataspace/services/weblog/item http://ods.openlinksw.com:8889/about/id/entity/http/www.openlinksw.com/dataspace/organization/openlink proxy:entity/http/www.openlinksw.com/dataspace/organization/vdb/sioc.rdf proxy:entity/http/www.openlinksw.com/dataspace/services/weblog/item proxy:entity/http/www.openlinksw.com/dataspace/person/netrista proxy:entity/http/www.openlinksw.com/dataspace/person/hwilliams proxy:entity/http/www.openlinksw.com/dataspace/organization/openlink/foaf.rdf proxy:entity/http/www.openlinksw.com/dataspace/person/tutorial_demo proxy:entity/http/www.openlinksw.com/dataspace/person/oerling/about.rdf https://www.openlinksw.com/about/id/entity/http/www.openlinksw.com/dataspace/organization/openlink
seeAlso	page 1
Date Modified	2014-11-13 16:09:00.580195-05:00(dt:dateTime)
link	http://www.openlinksw.com/weblog/oerling/?id=1830
id	9ab7fce76cb5fc1aac8f2d9fca713e49
content	In the Making It Interactive post on the LDBC blog, we were talking about composing an interactive Social Network Benchmark (SNB) metric. Now we will look at what this looks like in practice. A benchmark is known by its primary metric. An actual benchmark implementation may deal with endless complexity but the whole point of the exercise is to reduce this all to an extremely compact form, optimally a number or two. For SNB, we suggest clicks per second Interactive at scale (cpsI@ so many GB) as the primary metric. To each scale of the dataset corresponds a rate of update in the dataset's timeline (simulation time). When running the benchmark, the events in simulation time are transposed to a timeline in real time. Another way of expressing the metric is therefore acceleration factor at scale. In this example, we run a 300 GB database at an acceleration of 1.64; i.e., in the present example, we did 97 minutes of simulation time in 58 minutes of real time. Another key component of a benchmark is the full disclosure report (FDR). This is expected to enable any interested party to reproduce the experiment. The system under test (SUT) is Virtuoso running an SQL implementation of the workload at 300 GB (SF = 300). This run gives an idea of what an official report will look like but is not one yet. The implementation differs from the present specification in the following: The SNB test driver is not used. Instead, the workload is read from the file system by stored procedures on the SUT. This is done to circumvent latencies in update scheduling in the test driver which would result in the SUT not reaching full platform utilization. The workload is extended by 2 short lookups, i.e., person profile view and post detail view. These are very short and serve to give the test more of an online flavor. The short queries appear in the report as multiple entries. This should not be the case. This inflates the clicks per second number but does not significantly affect the acceleration factor. As a caveat, this metric will not be comparable with future ones. Aside from the composition of the report, the interesting point is that with the present workload, a 300 GB database keeps up with the simulation timeline on a commodity server, also when running updates. The query frequencies and run times are in the full report. We also produced a graphic showing the evolution of the throughput over a run of one hour -- (click to embiggen) We see steady throughput except for some slower minutes which correspond to database checkpoints. (A checkpoint, sometimes called a log checkpoint, is the operation which makes a database state durable outside of the transaction log.) If we run updates only at full platform, we get an acceleration of about 300x in memory for 20 minutes, then 10 minutes of nothing happening while the database is being checkpointed. This is measured with 6 2TB magnetic disks. Such a behavior is incompatible with an interactive workload. But with a checkpoint every 10 minutes and updates mixed with queries, checkpointing the database does not lead to impossible latencies. Thus, we do not get the TPC-C syndrome which requires tens of disks or several SSDs per core to run. This is a good thing for the benchmark, as we do not want to require unusual I/O systems for competition. Such a requirement would simply encourage people to ignore the specification for the point and would limit the number of qualifying results. The full report contains the details. This is also a template for later "real" FDRs. The supporting files are divided into test implementation and system configuration. With these materials plus the data generator, one should be able to repeat the results using a Virtuoso Open Source cut from v7fasttrack at github.com, feature/analytics branch. In later posts we will analyze the results a bit more and see how much improvement potential we find. The next SNB article will be about the business intelligence and graph analytics areas of SNB.
Title	LDBC: Creating a Metric for SNB
has creator	http://www.openlinksw.com/dataspace/oerling#this
is described using	http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling%27s%20Blog/1830/sioc.rdf
atom:source	Orri Erling's Blog
atom:updated	2014-11-13T21:09:00Z
atom:title	LDBC: Creating a Metric for SNB
links to	http://ldbcouncil.org/blog/making-it-interactive http://ldbcouncil.org/developer/snb http://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/BlogFiles20141111LdbcCreatingAMetricForSnb https://github.com/v7fasttrack/virtuoso-opensource/ https://github.com/v7fasttrack/virtuoso-opensource/tree/feature/analytics
atom:author	Orri Erling
label	LDBC: Creating a Metric for SNB
atom:published	2014-11-13T21:09:00Z
http://rdfs.org/si...ices#has_services	http://www.openlinksw.com/dataspace/services/weblog/item
type	Blog Post atom:Entry BlogPosting
is topic of	openlink's FOAF file borislav's FOAF file dba's FOAF file Orri Erling's FOAF file upstream's FOAF file Netrista Khatam's FOAF file Hugh Williams's FOAF file tutorial_demo's FOAF file OpenLink Software's FOAF file Virtuso Data Space Bot's FOAF file http://www.openlinksw.com/dataspace/organization/vdb/about.rdf OAT Management's FOAF file UDA Data Space Bot's FOAF file iODBC's FOAF file http://www.openlinksw.com/dataspace/organization/openlink/sioc.rdf Orri Erling's Blog http://www.openlinksw.com/dataspace/person/oerling/about.rdf http://www.openlinksw.com/dataspace/oerling http://www.openlinksw.com/dataspace/organization/dav/sioc.rdf http://www.openlinksw.com/dataspace/organization/vdb/sioc.rdf http://www.openlinksw.com/dataspace/person/iODBC/about.rdf jeep1688's FOAF file http://www.openlinksw.com/dataspace/services/weblog/item http://www.openlinksw.com/dataspace/person/openlink http://www.openlinksw.com/dataspace/organization/openlink/foaf.rdf
is interest of	OpenLink Software borislav dba Orri Erling upstream Netrista Khatam Hugh Williams tutorial_demo OpenLink Software Virtuso Data Space Bot oat UDA Data Space Bot iODBC

Faceted Search & Find service v1.17_git122 as of Jan 03 2023

Alternative Linked Data Documents: iSPARQL | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3330 as of Apr 5 2024, on Linux (x86_64-generic-linux-glibc25), Single-Server Edition (30 GB total memory, 24 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software