• Data - Database

    SSAS: Dimension Relationships in Cubes

    “Dimension relationship” refers to the direct or indirect relationships between dimension and its measure groups in a Cube. Regular Refers to a standard relationship, when a Key column in the dimension is directly joined to fact table. Reference When a Key column in the dimension is indirectly joined to fact table by referencing another dimension. Fact / Degenerate Dimensions constructed from attribute columns in fact tables than from attribute columns in dimension tables. Many-to-Many One dimension is associated with multiple facts Read more: https://docs.microsoft.com/en-us/sql/analysis-services/multidimensional-models-olap-logical-cube-objects/dimension-relationships?view=sql-server-2017 Note: My study notes

  • Technology

    Common RAID levels explained

    RAID 0 – Disk Striping – Used for the storage of noncritical items but which requires fast read-write.– Does not have parity (parity is about checking whether the data has been lost or overwritten on transition)– Does not have redundancy or fault tolerance. i.e., when the drive dies, the data is lost. RAID 1 – Disk Mirroring – Used usually for OS, SQL Engine etc. installation– Two or more disks used to write data and is in parallel– High performance– High availability– No data loss on disk failure RAID 2 RAID 3 RAID 4 RAID 5 – Striping with Parity…

  • Code - Technology

    Programming Puzzle #2 – Leet Converter

    Write a program in a computer language of your choice to convert any given text to “leet format” in real time. Leet (or “1337”), is a system of modified spellings used primarily on the Internet. Input: “Translator” Output:”Tr4nsl4t0r”Input: “leet”, Output: “l33t”Input: “Good Morning”, Output: “G00d M0rn1ng” Evaluation criteria: Code Quality Standards OOAD/Object-Oriented Analysis & Design Application Logic Exception Handling Simplicity and Effectiveness of code Time: 0-30 minutes max.

  • Code - Technology

    Programming Puzzle #1–Find the critical path

    Write a program in a language of your choice to find the critical path from a given set of tasks. A critical path is determined by identifying the longest stretch of dependent activities and measuring the time required to complete them from start to finish. Each circle (A-G) are tasks with specific duration (in Hours). Input: Array of task names and duration given in the diagram. Output 1. Longest path (Critical path) is A+G+B+F+C+D (42Hrs)2. Shortest path is A+B+C+D (26 Hrs)

  • Database - Technology

    Why Cosmos DB may not be apt for building Data Warehouse?

    Well, the question is slightly wrong until the context is specified because it is possible to build Modern Data Warehouse by including Cosmos DB in the architecture. This is too much relevant today because the data is no more straight forward content with human readable entities and relations (structured), but unstructured and/or streaming too. Also the pace of the data flow, or business requirement is becoming near real-time. See a reference architecture below: Image Source: MS Docs Here, in this blog, the context is about Traditional Data Warehouse possibility, where you will be modelling the data, specifying relationships, etc. Let…

  • Data - Database - Technology

    Getting started with Azure Databricks

    Introduction What is Azure Databricks? Azure Databricks is the same Apache Databricks, but a managed version by Azure. This managed service allows data scientists, developers, and analysts to create, analyse and visualize data science projects in cloud. Databricks is a user friendly, analytics platform built on top of Apache Spark. Databricks acts as an UI layer, a WYSIWYG dashboard where you can create clusters, manage notebooks, write code and analyse data without knowing the internals of the system. Apache Spark is a unified analytics engine for large scale data processing and currently it supports popular languages such as Python, Scala,…

  • Database

    SQL Server 2016 Row Level Security (RLS) Implementation

    Row Level Security Capability was introduced with 2016 version SQL Server. Same is available in Azure SQL Database also as of today. This blog will detail a novice example on how to implement. This is the planned implementation flow: About the example Prepare Sample data Create Predicate function Apply Security Policy Test Security 1. Plot For the purpose of example, we will take the case of an imaginary Super market. Let us assume there are Supervisors assigned to each department in the shop and we want each supervisor to see only items he is responsible for. RLS is applied on…

  • Technology

    Latency vs Throughput

    Latency is about how fast a request can travel from one point to another. Example: For sending 100Mb of data, the API will take one hour. Throughput is how much data can travel from one point to another in a specific time frame. Example: in one hour, this API can transmit 100Mb of data